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Abstract 

In this thesis, I study the hmit of a large number of colors (A'^) in a non-abelian 
gauge theory. It corresponds to a classical limit where fluctuations in gauge-invariant 
observables vanish. The large-dimension limit for rotation-invariant variables in atomic 
physics is given as an example of a classical limit for vector models. 

The baryon is studied in Rajeev's reformulation of two-dimensional QCD in the 
large-N limit: a bilocal classical field theory for color-singlet quark bilinears, whose 
phase space is an infinite grassmannian. In this approach, 't Hooft's integral equation 
for mesons describes small oscillations around the vacuum. Baryons arc topological 
solitons on a disconnected phase space, labelled by baryon number. The form factor 
of the ground-state baryon is determined variationally on a succession of increasing- 
rank submanifolds of the phase space. These reduced dynamical systems are rewritten 
as interacting parton models, allowing us to reconcile the soliton and parton pictures. 
The rank-one ansatz leads to a Hartree-type approximation for colorless valence quasi- 
particles, which provides a relativistic two-dimensional realization of Witten's ideas on 
baryon structure in the 1/N expansion. The antiquark content of the baryon is small 
and vanishes in the chiral limit. The valence-quark distribution is used to model parton 
distribution functions measured in deep inelastic scattering. A geometric adaptation of 
steepest descent to the grassmannian phase space is also given. 

Euclidean large-AT multi-matrix models are reformulated as classical systems for 
U{N) invariants. The configuration space of gluon correlations is a space of non- 
commutative probability distributions. Classical equations of motion (factorized loop 
equations) contain an anomaly that leads to a cohomological obstruction to finding an 
action principle. This is circumvented by expressing the configuration space as a coset 
space of the automorphism group of the tensor algebra. The action principle is in- 
terpreted as the partial Legendre transform of the entropy of operator-valued random 
variables. The free energy and correlations in the N ^ oo limit are determined varia- 
tionally. The simplest variational ansatz is an analogue of mean-field theory. The latter 
compares well with exact solutions and Monte-Carlo simulations of one and two-matrix 
models away from phase transitions. 
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Chapter 1 

Introduction 

1.1 QCD and the Large- Classical Limit 

In this section, we motivate and provide the historical context for the issues studied in 
this thesis. We use analogies with celestial mechanics and atomic physics, since they 
are familiar and more mature physical theories. However, the analogies are not to be 
taken too literally. A summary of the thesis and a short survey of the literature is also 
included in this introduction. 

The Standard Model: Strong and Electroweak Interactions 

At the turn of the twenty-first century, the standard model of particle physics for 
electroweak and strong interactions is at an intermediate stage of development, one that 
is similar to a stage in mechanics after Newton's force laws had been discovered, but 
hamiltonian and lagrangian mechanics were still being developed. The latter are not 
merely elegant repackagings of the force laws in terms of single functions, but provide 
a way to go beyond the two body problem in a systematic manner, and passage to 
continuum mechanics. Moreover, it is difficult to imagine doing statistical mechanics 
without the idea of a hamiltonian. What is more, these developments played a key role 
in the discovery of the "next theory", quantum mechanics. Drawing an analogy with 
atomic physics, we are at a stage after the discovery of quantum mechanics, but before 
its reformulation in terms of path integrals. Path integrals are indispensable to the 
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theoretical development that followed quantum mechanics, quantum field theory. 

Spectacular experimental discoveries, such as patterns found in the hadronic spec- 
trum 0ni EHj ) electroweak gauge bosons |H3 , and scaling in deep inelastic scattering pS] , 
parallel the discovery of the detailed orbits of planets and moons, or of discrete atomic 
spectra. On the other hand, equally deep theoretical discoveries such as the gauge 
principle |lU6j . electroweak unification |lU2j . asymptotic freedom in non-abelian gauge 
theories [S7j, and perturbative renormalizability of unbroken and spontaneously-broken 
gauge theories |56j remind us of the discovery of the inverse-square force law between 
the sun and planets, or of the uncertainty principle and the Schrodinger equation for 
electrons in atoms. 

Based on experiment and guiding principles such as gauge invariance and renormal- 
izability, the lagrangian of the standard model has now been almost completely defined. 
The standard model is based on non-abelian gauge theories (Yang-Mills theories) for the 
force carriers: gluons for the strong force, and the W and Z electroweak gauge bosons, 
which are generalizations of the photon of electromagnetism. These bosons couple to 
fermionic matter particles, the quarks and leptons (which include the electron). Finally, 
an untested part of the standard model consists of scalar fields that are expected to 
provide spontaneous breaking of the electroweak gauge symmetry. 

Quantum Chromodynamics (QCD) is the unbroken non-abelian gauge theory de- 
scribing the interactions of quarks and gluons. In a gauge theory, the basic gluon and 
quark fields have some redundant degrees of freedom that are not observable. The true 
dynamical degrees of freedom (the "hadrons" ) are determined by the principle of gauge 
invariance. In QCD, electric charge is replaced by color charge of quarks and gluons. 
Unlike photons, which are electrically neutral, gluons carry color charge. QCD also has 
the property of asymptotic freedom which, at high momentum transfers, suggests that 
quarks and gluons behave as free particles (up to logarithmic corrections). This is in 
contrast to the electrostatic force between charges or the gravitational force between 
planets and the sun, which grow stronger at short distances. On the other hand, the 
strong force does not fall off at large distances, but leads instead to the phenomenon of 
confinement. In fact, the "fundamental particles", quarks and gluons, have never been 
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isolated. Unlike the electron, which can be removed from an atom by ionization, quarks 
and gluons seem to be confined inside (colorless, gauge-invariant) bound states of the 
strong interactions. These are called mesons (such as the pion) and baryons (such as 
the proton and neutron). In addition, there are glueballs, which are gauge-invariant 
bound-states of gluons alone. However, there is no quantum number that distinguishes 
glueballs from mesons. These are collectively referred to as hadrons. 

The success of perturbation theory around the h classical limit 

Non-abelian gauge theories, being quantum field theories, can be illuminated through 
perturbation around a classical limit. A classical limit is one in which the fluctuations 
in some observables vanish. In the limit of vanishing Planck's constant, the fluctuations 
in all observables of a gauge theory, not just those that are gauge- invariant, vanish. 
So it is not surprising that the loop expansion around the free-field classical limit of 
^ — > ("perturbation theory") has been tremendously successful in making quantitative 
predictions about electroweak interactions. The latter are not just weak at currently 
accessible energies, but also involve a spontaneously-broken gauge symmetry. Thus, all 
states, not just the gauge-invariant ones, and in particular, the gauge bosons W^,Z^, 
are observable. 

As for the strong interactions, because of asymptotic freedom, perturbation theory 
makes accurate predictions about some aspects of their high-energy behavior. In partic- 
ular, logarithmic scaling violations, and other phenomena where quarks and gluons can 
be treated as almost free, are well described by perturbation theory around the h ^ 
limit. Indeed, QCD is the only known renormalizable four-dimensional theory with the 
correct high-energy behavior. This is primarily why it is believed to be the correct the- 
oretical model for strong interactions. (The quantum theory can also be studied around 
non-trivial classical solutions in the h ^ limit, such as solitons and instantons '85]). 

Why study QCD? What is left to do? 

The discovery of the basic lagrangian, and the verification of its predictions for short 
distances, is only the first step in understanding dynamics. The finite-time and asymp- 
totic behavior of solutions of the differential equations of classical mechanics hold a lot 
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of information and surprises not evident over short times. In celestial mechanics, elhp- 
tical orbits of planets could be derived easily from Newton's equations of motion. But 
the effects of Jupiter on the motion of other planets, or the estimation of the perihe- 
lion shift of Mercury, required reformulations of the theory and the development of new 
approximation methods, such as classical perturbation theory. These eventually led to 
the discovery of general relativistic corrections to newtonian gravity. Another example 
is the study of the stability of the solar system, which in turn led to the development of 
chaotic dynamics and Kolmogorov- Arnold- Moser theory [2]. 

We have a similar situation in QCD. Perturbation theory around the trivial vacuum 
in the 7i — > limit allows us to examine the strong interaction at only short distances, 
where quarks and gluons are almost free. This tells us very little about the long-distance 
behavior or about bound states of quarks and gluons. In fact, the problem is quite severe, 
since all observed particles are hadronic bound states. A mechanism that explains both 
qualitatively and quantitatively the confinement of quarks and gluons within hadrons 
has not been found. The empirically deduced long-range linear potential between quarks 
has also not been derived from QCD. How quarks and gluons bind to form hadrons, in 
other words the structure functions of these bound states (such as the proton), cannot 
be understood by merely perturbing around the vacuum. Though gluons are massless, 
the lightest observed particle of the theory is massive. Understanding this mass gap in 
the hadronic spectrum is an outstanding challenge. There are other peculiarities in the 
mass spectrum of hadrons. For example, the linearly rising Regge trajectories, where 
the angular momenta of hadronic resonances (short-lived particles) are linearly related 
to the squares of their masses, suggests that quarks are held together by a string with 
constant tension |44j . Chiral symmetry-breaking is yet another phenomenon one would 
like to establish in QCD. Perhaps its most dramatic manifestation is that practically 
massless quarks (up and down, 5 — 7 MeV) bind to form the proton, which is over 
a hundred times as heavy (938 MeV). The situation is such that, even though QCD 
is believed to be the correct theory, it has not been possible, aside from large-scale 
numerical simulations and certain non-relativistic situations involving mesons with two 
heavy quarks, to calculate the mass or wave function (structure function) of even a 
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single observed particle in the theory! However, there are many simplified models such 
as dimensional reductions, strong coupling expansions, low energy effective theories or 
supersymmetric generalizations that exhibit some of these features. One hopes that 
further progress can be made by combining new physical and mathematical ideas. Of 
course, there is a wealth of experimental data |HH] and increasingly accurate numerical 
simulations (see for example |25l I98j ^ with which to compare. Finally, we can hope that 
a deeper understanding of QCD will lead us to physical principles and mathematical 
structures, that are needed for any future physical theory that observation may require 
us to invent. For example, a deep appreciation for the hamiltonian and Poisson bracket 
formulation of classical mechanics led Dirac to invent the canonical formalism, which 
is more convenient for quantum field theory than the original Schrodinger equation of 
single particle quantum mechanics. 
Alternative Classical Limits 

Quantum theories often have more than one classical limit in which some observ- 
ables do not fluctuate. Usually, each is characterized by assuming a limiting value for 
some parameter. Different classical limits are useful for formulating different phenomena 
characterizing the same underlying quantum theory. For instance, in atomic physics, the 
^ — > classical limit gives no indication of why the atom is stable. However, there are 
other classical limits in atomic physics. The limit of a large number of dimensions is 
a classical limit where rotationally invariant variables do not fluctuate. This limit pro- 
vides a "classical" explanation for why the electron does not fall to the minimum of the 
Coulomb potential (see ^H])- Atomic Hartree-Fock theory is yet another classical limit 
that provides a way of studying many-electron atoms [HZJ . 

QCD has a different classical limit from ?l ^ 0. This is the limit of a large number 
of colors N, where the structure group of the gauge theory is SU{N). There appear 
to be three colors in nature. Fluctuations in gauge-invariant observables alone vanish 
in the large-A^ limit. Thus, we should expect the large-A^ classical limit to be a better 
approximation than the h ^ limit for unbroken gauge theories with strongly-coupled 
gauge fields, where it is only the gauge invariant states that are observed. This is the 
regime of interest for the strong-interaction phenomena mentioned previously. The large- 
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limit, as an approximation for non-abelian gauge theories, was originally proposed by 't 
Hooft in a perturbative context [S^. In this limit, planar Feynman diagrams were found 
to dominate, 't Hooft calculated the spectrum of mesons in the large- limit of two- 
dimensional QCD by summing such planar diagrams |53j, and found an infinite tower of 
bound states, and the analogs of linearly-rising Regge trajectories. The planar diagrams 
also suggested a connection to a string model of hadrons which was expected from the 
linearly rising potential and Regge trajectories. We will see later that summing such 
planar diagrams only gives the linear approximation around the vacuum of the large- 
limit. Nevertheless, several other empirical facts about the strong interaction also seemed 
likely to be accommodated in the large- limit (see Witten jlL)4j l. For example, Zweig's 
rule, an empirical deduction concerning, for instance, the mixing of mesons with glue or 
exotic states, is exact in the linear approximation to the large- A^ limit where mesons are 
pure qq states jlU4j . Thus, there are theoretical and phenomenological reasons to expect 
that A^ = oo is a good starting point for an approximation to QCD, though only A^ = 3 
corresponds to nature. 

Despite the success of the diagrammatic point of view, it obscures the classical na- 
ture of the large- A^ limit. Though it is believed that the large- A^ limit of QCD is a 
classical limit, its formulation as a classical dynamical system is not well understood. In 
particular, we would like to identify the classical configuration space, the equations of 
motion and action or hamiltonian, and also develop the approximation methods required 
to understand the theory. In this thesis, we present some investigations of the large- A^ 
limit as a classical limit for two simplified models of QCD: the quark structure of baryons 
in two-dimensional QCD, and matrix models as a simple model for gluon structure. 
More on why QCD is difficult: Divergences and Gauge Invariance 

Three of the many aspects that make QCD hard to understand are the large num- 
ber of degrees of freedom, gauge invariance, and ultraviolet divergences. As currently 
formulated, QCD is a divergent quantum field theory. The only known way to make 
finite predictions in four space-time dimensions, aside from the numerical lattice-QCD 
approach, is by the procedure of perturbative renormalization [^. While very success- 
ful, this procedure is inherently tied to the perturbative solution of the theory. Rather 
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than address the important question of finding an alternative finite formulation of the 
theory, we will work with regularized versions of the theory. One such regularization is a 
matrix model, which reduces the number of degrees of freedom. We will also work with 
a reduction to two dimensions, where the theory is ultraviolet finite without any need 
for regularization. This will allow us to focus attention on the difficulties arising from 
gauge invar iance. 

In a gauge theory, we have the unusual situation where a physical theory is formu- 
lated in terms of unobservable particles, the quarks and gluons. To accommodate the 
properties of the observed hadrons, it is then necessary to rewrite the theory in terms 
of gauge-invariant observables. The principle of gauge invariance is no longer relevant 
when we work with a gauge-invariant formulation of the theory. We must look for other 
geometric, probabilistic, or algebraic principles that play as important a role. This is 
still a formidable task in the full 3-1-1 dimensional theory. So we address this question in 
the simpler contexts mentioned above. Quarks transform as A^-component vectors under 
color, the SU {N) structure group, while gluons transform in the adjoint representation 
as N X N hermitian matrices. The components of these vectors and the matrix elements 
are the so-called "gauge degrees of freedom" (quarks and gluons) that carry the color 
quantum number, and are not directly observable. Only color-invariant combinations 
are observable. Furthermore, gauge-invariant observables (the Wilson loop and meson 
observables) are non-local, which means that we need to make the passage from local 
gauge fields to non-local loop or string-like variables. 
The Baryon in the Large- Limit of 2d QCD 

Vector models have fewer degrees of freedom, making them easier to deal with than 
matrix models. We therefore devote the first half of this thesis to the vector model of 
quarks in two-dimensional QCD, interacting via a linear potential due to longitudinal 
gluons. 't Hooft's work on two-dimensional QCD left a puzzle as to how to handle 
baryons. An early proposal of Skyrme was that baryons must arise as solitons in a 
theory of mesons |95j . However, 't Hooft's equation for mesons was linear, and did not 
support soliton solutions. Rajeev discovered a bilocal (along a null line) reformulation of 
two-dimensional QCD in terms of color-invariant quark bilinears. In the large- A'^ classical 
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limit, it is a hamiltonian dynamical system whose phase space is an infinite dimensional 
grassmannian manifold. The phase space is a curved manifold because the quark density 
matrix is a projection operator, making this theory strongly interacting even in the large- 

limit. This is an indication of the type of geometric ideas that may play a role in 
a gauge-invariant reformulation of QCD. 't Hooft's meson spectrum is recovered as a 
linear approximation to the equations of motion on the grassmannian around the naive 
vacuum. Moreover, the phase space is disconnected, with components labelled by an 
integer-valued baryon number. The baryon arises as a topological soliton, and we study 
this in this thesis. In particular, we determine the "shape", or more precisely, the form 
factor, which tells us, roughly, how quarks are distributed inside the proton, though in 
a color invariant manner, without explicit reference to color carrying quarks. 

We also examine another puzzle regarding the structure of the proton: the relation 
between the soliton and parton pictures. The parton model is a complementary view to 
the soliton picture of the baryon I17j . Deep inelastic scattering experiments ;^38 ■ in- 
dicate that the constituents of the proton, called partons, are point-like. This is because 
the proton structure function is roughly scale invariant, i.e., independent of the length 
scale at which it is observed. This was accommodated in the parton model by postulat- 
ing that the proton is made of point-like partons. The latter had to be non- interacting in 
order to match the observed high-energy behavior. Partons were identified with quarks 
and gluons after the discovery of QCD. Their non-interacting behavior at high ener- 
gies was regarded as a consequence of their asymptotic freedom. The QCD-improved 
parton model (perturbative QCD) made accurate predictions for how scale invariance 
is broken, i.e., the logarithmic scale dependence of structure functions. However, the 
dependence of structure functions on the momentum fraction carried by a parton, which 
is akin to an atomic wave function, is non-perturbative and remained inaccessible. In 
Part m of this thesis, we derive an interacting parton model from the solitonic point of 
view, thereby reconciling the two disparate points of view in two space-time dimensions. 
We also determine the non-perturbative momentum-fraction dependence of the quark 
structure function within two-dimensional QCD. This two-dimensional theory provides 
a good approximation to Deep Inelastic Scattering, and we use it to model the quark 
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distributions measured in experiment f ^3.5|) . 
Large- Matrix Models 

Our analysis of quark structure still leaves the gluon distribution undetermined. Glu- 
ons are the force carriers between quarks, just as photons transmit the force between 
electric charges in QED. However, QED is an abelian gauge theory while QCD is a 
non-abelian gauge theory, leading to strong self interactions between gluons. One man- 
ifestation of this is in the ability of gluons (unlike photons) to form hadronic bound 
states, referred to as glueballs. The non-abelian nature of gluons also leads to another 
phenomenon that makes the strong interactions very different from the quantum me- 
chanics of an atom. The electron wave function determines the shape of an atom, and, 
within the non-relativistic theory, the photon does not carry any momentum of an atom. 
But for a proton, when the momentum transferred by the probe is about 1 GeV, it is 
an experimental fact that quarks and gluons carry roughly equal portions of its total 
momentum The contribution from gluons only grows with the probe's momentum 
transfer 

Of the four possible states of polarization of the gluon in four dimensions, only two 
are dynamical i.e. its transverse states. The longitudinal and time components can 
be eliminated by gauge fixing and solving the resulting constraint equation. Within 
two-dimensional QCD, there are no transverse gluons, and they were ignored previously. 
Now we turn to dynamical gluons, namely, the propagating gluon degrees of freedom 
which cannot be eliminated by gauge fixing. In Part ^ of this thesis, we study euclidean 
multi- matrix models in the large- classical limit. They are a regularized version of 
gluon dynamics. Understanding gluon dynamics is much harder than quark dynamics. 
The reason is that there are many more gauge-invariant gluon observables than pure 
quark observables. The dot products of pairs are the only invariants of a set of vectors, 
while the traces of arbitrary products of a collection of matrices (the gluon correlation 
tensors) are all unitary invariants. 

It is believed that the large-A^ limit of matrix models comprises a classical theory. 
In this thesis, we identify this classical theory in a manifestly unitary invariant manner. 
We show that the configuration space is a space of non-commutative probability distri- 
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butions. The gluon field is an x matrix-valued random variable. These matrices at 
different points of space-time do not commute, and as a consequence we are dealing with 
a non-commutative version of probability theory (Chapter 0}. The coordinates on the 
configuration space are gluon correlations. Their fluctuations vanish in the large- limit, 
and satisfy the factorized Schwinger-Dyson (or loop) equations. We identify these as the 
classical equations of motion f ^5.1() . However, a classical action is very elusive. There 
is an anomaly in the equations arising from the transformation from matrix elements to 
invariants, manifested as a cohomological obstruction to finding an action on the config- 
uration space f ^5.2.1|) . We circumvent this obstruction by expressing the configuration 
space as a coset space of a non-commutative analogue of the diffeomorphism group by an 
isotropy subgroup. We then find a classical action on the group that is invariant under 
the action of the isotropy subgroup. So, the action really lives on the quotient, i.e., the 
configuration space ( H5.2.2() . Our search for principles that determine a gauge- invariant 
formulation of large- matrix models leads us to the automorphism group of the free 
algebra and its first cohomology! 

What is more, we find that the entropy of non-commutative probability theory, de- 
veloped as a branch of operator algebra by Voiculescu and collaborators plays a 
central role. Whenever we restrict the allowed observables of a physical system, we 
should expect an entropy due to our lack of knowledge of those observables. For exam- 
ple, entropy in statistical mechanics arises because we do not measure the velocities of 
individual gas molecules but only macroscopic variables such as pressure and density. 
Similarly, confinement of the color degrees of freedom should lead to an entropy in the 
strong interactions. In a matrix model, expressing the action in terms of unitary invari- 
ant gluon correlations rather than unobservable matrix elements, leads to an entropy. 
We find ( H5.3|) a variational principle for the solution of the classical equations of motion 
for gluon correlations by maximizing entropy subject to certain constraints. 
Approximation Methods 

Progress in newtonian mechanics came about through at least two ways: (1) general 
approximation methods, or alternative formulations of the theory, and (2) exact solutions 
of some special systems. Examples of (1) are hamiltonian and lagrangian mechanics. 
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Hamilton-Jacobi theory, perturbation theory and variational principles. Examples of (2) 
are Jacobi's solution of the rigid body, and the theory of integrable systems. 

A common theme throughout this thesis is the use of approximation methods to 
solve the classical theories we get in the large-A^ limit. A classical limit is itself an 
approximation. However, both the classical theories we get, dynamics on an infinite 
grassmannian for quarks in two-dimensional QCD, and the maximization of entropy 
in multi-matrix models, are highly non-linear and non-local classical theories. They 
require the development of new non-perturbative approximation methods. Here, we 
take inspiration from the approximation methods of classical mechanics, atomic physics, 
and many-body theory. We look for analogs of variational principles, mean-field theory, 
Hartree-Fock theory, steepest descent, and the loop expansion. 

Finding the ground-state of the baryon involves minimizing its energy on an infinite 
dimensional curved phase space. The first method we develop is a geometric adaptation 
of steepest descent for a curved phase space (3S^- Then we find a variational approxi- 
mation method that replaces the full phase space with finite dimensional submanifolds, 
and we study the dynamical system on reduced phase spaces ( ^3.21 ^3.4|) . In the sim- 
plest case, we get an analogue of mean-field theory (Hartree-Fock theory) for a system 
of interacting colorless quasi-particles. This is how we are able to derive the interacting 
valence parton picture from the exact soliton description of the baryon ( ^3.2.21 H3.2.3() . 
We also show how to go beyond this, and include anti-quarks ( ^l-i.4.2|) . Though the em- 
phasis is on approximate solutions, along the way we also find the exact form factor of 
the baryon in the large- limit of two-dimensional QCD for massless current quarks (see 
(jSiHHl))- We compare our approximate solution for the quark distribution function with 
numerical calculations |HH] and also measurements from Deep Inelastic Scattering, and 
find good agreement (see ij3.5|) . 

Since the 1980s significant progress was made in finding exact solutions for partition 
functions and special classes of correlations of carefully chosen matrix models, such as two 
matrix models with specific interactions, matrix chains associated with Dynkin diagrams 
of simply-laced Lie algebras etc. The methods often originated in the theory of integrable 
systems or conformal field theory (see for example j78 | \77 \ UTTl 1661 l,S4j ) . However, there 



Chapter^ Introduction 



12 



is a lack of approximation methods to handle generic matrix models in the large-A^ 
limit. Even the analogs of simple methods such as mean- field theory and variational 
principles were previously not known. We find a variational principle that allows us to 
determine the gluon correlations from a finite-parameter family that best approximate 
the correlations of a given matrix model ( ^5.2|) . We use this variational principle to find 
an analog of mean- field theory for large- multi-matrix models f ^5.4() . and also indicate 
how one can go beyond mean-field theory ( TO.3|) . These approximation methods compare 
favorably with exact solutions, and with a Monte-Carlo simulation away from divergences 
in the free energy f ^5.4.11 ^5.4.2() . For other approaches to solving matrix models, see 
for instance the work of J. Alfaro et. al. 0. 
Literature on Matrix Models in High Energy Physics 

Since the 1970s, there has been a great deal of work done on large- matrix models. 
Some of the earlier papers are reproduced in the collection of Ref. ^HI- We mention a 
few of the many developments. Brezin, Itzykson, Parisi and Zuber studied the euclidean 
one-matrix model using the saddle point method in the large- A^ limit. They found an 
important relation between the quantum mechanics of a single matrix in the large-A^ 
limit and a system of free fermions J^. Migdal and Makeenko found that the Wilson 
loops of a large- A^ gauge theory satisfy a closed set of "factorized loop" equations |74j . 
Yaffe's coherent states approach |1()5| . and Sakita's and Jevicki's jHiB 1021 work on the 
collective field formalism of large- A^ field theories was an important step in understanding 
an anomaly in the hamiltonian of the large- A'^ limit. The anomaly is one of the main 
differences relative to the h ^ classical hamiltonian. Our recent work shows that this 
anomaly is in fact the non-commutative analogue of Fisher information of probability 
theory pQ. The papers of Cvitanovic and collaborators |261 127j on planar analogues of 
some of the familiar methods of field theory, but with non-commutative sources was 
helpful in our algebraic formulation of the problem. Eguchi's and Kawai's [HSj proposal 
on reducing a matrix field theory to a matrix model with a finite number of degrees of 
freedom, but in the large- A^ limit, has been a recurring theme ever since. 

Important breakthroughs in the study of random surfaces, two-dimensional string 
theory, and two-dimensional quantum gravity coupled to matter, were made after the 



Chapter^ Introduction 



13 



mid 1980s (see Ref. |42 for a review). The planar Feynman-graph expansion of large- 
matrix models was used as a way of discretizing a two-dimensional surface. Models with 
one or a finite number of matrices, and the c = 1 quantum mechanics of a single matrix, 
were of importance in these developments. The double scaling limit was developed to 
study surfaces obtained in the continuum limit. In the double scaling limit, the coupling 
constants are tuned to critical values as — > cxd. This limit is not a classical limit, 
unlike the 't Hooft large- limit. Fluctuations in observables remain large in the double 
scaling limit. However, the double scaling limit allows one to include contributions from 
all genera in the topological expansion of Feynman diagrams. 

The work of Seiberg and Witten in the early 1990s on electric-magnetic duality 
allowed the elucidation of vacuum structure of a large class of supersymmetric gauge 
theories along with the mechanisms of chiral symmetry-breaking and confinement in 
these cases 

In the mid 1990s, supersymmetric matrix models were proposed as non-perturbative 
definitions of M-theory and superstring theory [13| I59j. From the late 1990s onwards, 
there has been a great deal of work on the large- limit of supersymmetric gauge theories, 
catalyzed by the AdS/CFT correspondence of Maldacena j75j . Large- A^ matrix models 
are also used to study the effective super potentials for AT = 1 supersymmetric gauge 
theories with adjoint chiral super fields, following the work of Dijkgraaf and Vafa |281 
122] • Matrix models also find applications to the problem of determining the anomalous 
dimensions of operators in = 4 supersymmetric Yang-Mills theory |80| Ej . 

Random-matrix theory has also been applied to the spectrum of the QCD Dirac 
operator (see especially the work of Verbaarshot et. al. 101 ). 
Random matrices in other areas of Physics and Mathematics 

Remarkably, random matrices have found applications in many areas of physics and 
mathematics outside of particle and high-energy physics. We list a few of them. 

Random matrix theory originally arose from the suggestions of Wigner and Dyson 
in the 1950s, that the statistical properties of the spectra of complicated nuclei could be 
modelled by a random hamiltonian |77j . 

Spin systems on random two-dimensional lattices have been studied using the large- 



Chapter^ Introduction 



14 



limit of matrix models. For example, Kazakov studied the Ising model on a random 
two-dimensional lattice with fixed coordination number j65j . 

Random matrices also have deep connections to statistical properties of zeros of the 
Riemann zeta function [77j . Montgomery and Dyson discovered that the pair correlation 
of scaled zeros of the Riemann zeta function is asymptotic to that of eigenvalues of a 
large random unitary matrix [HJ . More recently, the universal part of the moments of 
the zeta function on the half line have also been found to be related to those of the 
characteristic polynomial of a large random unitary matrix j67j . 

Some chaotic quantum systems have been modelled by universal properties of large- 

matrix models, as have the universal correlations in some mesoscopic and disordered 
systems [171 1^ . 

The work of mathematicians, including Voiculescu and collaborators, on von Neu- 
mann algebras led to the development of the field of non-commutative probability the- 
ory inn]- The correlations of large- matrix models give natural examples of non- 
commutative probability distributions. 

1.2 QCD, Wilson Loop, Gluon Correlations 

Classical Chromodynamics 

Classical Chromodynamics (the ?i ^ limit of QCD) in 3 -|- 1 space-time dimensions 
is a non-abelian gauge theory with structure group SU{N), where the number of colors 
is = 3 in nature. The matter fields q^ix) are quarks, spin ^ fermions transforming 
under A'^^ copies of the fundamental representation of SU{N). 'a' is a flavor index and 
'a, V are color indices. Quarks come in Nj flavors, where Nj = 6 (up, down, strange, 
charm, bottom and top in order of increasing "current" quark mass; though quarks 
have not been isolated and "weighed" , one can define their mass using their interactions 
with electroweak currents). At low energies, only up and down are important for the 
proton and neutron. For the most part, we will ignore flavor dependence in this thesis. 
The bosonic gauge (gluon) fields [A^(x)]{J', /x = 0,1,2,3 are four N x N hermitian 
matrix- valued fields. They are the components of a connection one- form on Minkowski 
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space-time R^'^, valued in the Lie algebra of SU{N). Let A = denote the space 

of connections. The theory is defined by the action 

N r r 
^0 = -T^ / d^x tr F'^-F^, + / if^[-ij^[D^t " mX]qabd''x (1.1) 

•y^ are the Dirac gamma matrices. Here the Yang-Mills field strength is the anti-hermitian 
matrix field F^jy{x) = d^A^ — dyA^ + [A^, Ay\. The covariant derivative is = (15^ — 
iA^) where 1 is the N x N unit matrix. The coupling constant g is dimensionless in 
3+1 dimensions. The classical solutions are field configurations (^^(x), q{x),q{x)) 
that extremize the action, the solutions of the partial differential equations ^D^F^^'^ = 



^{d^F^"" - i[A^, F^"']) = f. Here [j"]^ = q'^^Qb is the quark current. 



Notice that the action 5*0 is Lorentz invariant. However, not all observables need to 
be Lorentz invariant. Lorentz transformations (which are isometrics of Minkowski space 
preserving the metric ds^ = {dx^)"^ — {dx^)"^ + (dx^)^ + (dx^)^) merely relate observables 
in different reference frames. 

Now, the action is also invariant under the group of local gauge transformations 
G = {U{x)} 

A^{x) ^ UA^U-^ + Ud^U-^; q{x) ^ U{x)q{x) (1.2) 

where U{x) is a map from space-time to the structure group SU {N) that tends to the 
identity at infinity. The principle of gauge invariance states that not just the action, 
but every observable of the theory must be invariant under gauge transformations! The 
most famous gauge-invariant observable is the parallel transport along a closed curve 
7 : [0, 1] R^'^; 7(0) = 7(1), the so-called Wilson loop observable 



tr 



^(7) = -Pexpj -i jj^ A^{^{s)) r{s) ds^ (1.3) 

where Pexp stands for the path ordered exponential. One can also consider an open 
string or meson observable with quarks at the end points (here 7 is not a closed curve) 



M{l) = ^g-(7(0))" 



Pexpj -i^^ A^(7(s)) Y{s) ds 



''?(7(l))fe- (1-4) 



We shall give the physical interpretation of the Wilson loop when we discuss its expec- 
tation value in the quantum theory. 
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Quantum Chromodynamics 

So far, we have been discussing Classical Chromodynamics. In the path integral ap- 
proach to quantization, q{x), q{x)) become random variables. Quantum Chro- 
modynamics (QCD) is the assignment of expectation values to gauge-invariant functions 
of these random variables. Naively, they are obtained by averaging over the quark and 
gluon fields with a weight given by e*"^"/^. However, on account of the gauge invariance 
of the action, this functional integral is ill-defined. Rather than average over the entire 
space of connections A, we should be averaging over the space of connections modulo 
gauge transformations A/G with the measure induced by the Lebesgue measure on A. 
A/Q parameterizes the true degrees of freedom according to the gauge principle. The 
idea is to choose a representative for each orbit oi Q va. A (gauge fixing) and integrate 
over the coset representatives. In effect, the gauge field has only two independent com- 
ponents (for example, the transverse polarization states) after taking into account the 
relations imposed by gauge transformations. However, in order to maintain manifest 
Lorentz covariance, it is sometimes more convenient to retain all four components of the 
gauge field, and introduce Fadeev-Popov ghost fields (a pair of grassmann-valued hermi- 
tian matrix fields, c^(a;), c^(rc)) that act as negative degrees of freedom. The standard 
implementation of this idea jHJ iii the so-called covariant gauges leads to the gauge fixed 
action 



The two additional terms in the action come respectively from the gauge fixing and the 
Fadeev-Popov determinant, which is the jacobian determinant for the induced measure. 
The expectation values of gauge-invariant observables 0{A^q,q) are independent of the 
gauge parameter ^ and are given by 



S{A,q,q,c, c 





{0{A,q,q)) 




[,q,q,c,c)e'^/'' 0{A,q,q) 
J d{A, q, q, c, c)e^^/'^ 




For example, if O is the Wilson loop observable, then 



oo 



—I 



(7(si),--- 





n! 



^J.l■■■^ln 



n=0 
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where the gluon correlation tensors are 

G^^...f,„{xi,--- ,Xn) = {^A^^{xi)---A^„{xn)) (1.8) 

Though the gluon correlation tensors are not gauge- invariant, once they have been deter- 
mined in any specific gauge, they may be used to compute the expectation values of other 
gauge-invariant quantities such as the Wilson loop. Since the Wilson loop W{'y) and me- 
son observables M{'y) are gauge invariant, their expectation values can be calculated in 
any convenient gauge. 

The gauge fixed action (|1.5I) . is no longer invariant under local gauge transforma- 
tions, but rather under global unitary transformations q{x) Uq{x), q{x) q{x)U^ , 
A^{x) 1-^ UA^{x)W, c{x) 1-^ C/c(x)C/'l", c(x) i— > Uc{x)U'^. In other words, we have a 
matrix field theory (of both bosonic {A^) and fermionic (c, c) adjoint fields) coupled to 
a vector model. As it stands, this definition leads to ultra-violet divergent expectation 
values in 3 -|- 1 dimensions, and we have to supplement this with rules for regular- 
ization and renormalization. In particular, the coupling "constant" is replaced by a 
"running coupling constant" (7^(Q^), which depends on the momentum scale Q. For 
large Q^, we have the perturbative result that the coupling vanishes logarithmically: 

~ l/log((5^/AQ^£)). Renormalization introduces the dimensional parameter Aqcd 
which sets the scale for the strong interactions and is to be determined experimentally 
{Aqcd ~ 200 MeV). 

In this thesis, we study a pair of finite truncations of QCD. In Part I of this thesis we 
will study the vector model alone, though in two dimensions and in the null gauge where 
there are no ghosts, gluons can be eliminated and there are no ultra-violet divergences. 
In Part II we will study hermitian bosonic matrix models, which are matrix field theories 
where space-time has been regularized to have only a finite number of points. 
Wilson's Area Law Conjecture: We conclude this section with Wilson's area law 
conjecture on the asymptotic expectation value of Wilson loop observables: 

(VF(7)) ~ e""'^''(T) as Ar(7) ^ oo (1.9) 

where Ar(7) is the minimal area of a surface whose boundary is 7. This is a statement 
about the vacuum of the pure gauge theory with no dynamical quarks. However, we 
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can heuristically interpret the conjecture as an asymptotically linear potential between 
quark sources. To see this, consider a current which is concentrated on the curve 

7^(s). It is regarded as the current density of a quark-antiquark pair that are produced 
and annihilated and whose combined trajectory is the closed curve 7. Then 

^ A^(7) r{s) ds = J A^{x)f{x)d''x (1.10) 

and {W{'y)) is the probability amplitude for this process. In particular, consider a 
rectangular loop in the x^ — x^ plane with length L and time T. Suppose further, that the 
potential energy between quarks is asymptotically linear, E ~ a'L. a' is called the string 
tension since the linear potential corresponds to quarks being held together by a string 
with constant tension. Then the amplitude for this process is e~^'^ ~ e~°^'^^ = e~"'"^'"('>'). 
Thus, the Wilson area law conjecture is a criterion for confinement of non-dynamical 
quarks by an asymptotically linear potential. 

1.3 (i ^ 00 as a Classical Limit in Atomic Physics 

To motivate the idea of the large- limit as a classical limit of matrix models and gauge 
theories, let us first describe a much simpler but analogous idea in the more familiar 
area of atomic physics: the problem of determining the ground-state energy and wave 
function of electrons in an atom. 

The ?i — > classical limit is not a good approximation, since the atom is not stable 
in this limit. The Coulomb potential is not bounded below and the electrons would fall 
into the nucleus. Of course, in the quantum theory, this is prevented by the uncertainty 
principle: momentum would grow without bound if we tried to concentrate the electron 
wave function at the minimum of the Coulomb potential. However, there is another 
classical limit, where the number of spatial dimensions d 00, in which the atom has 
a stable ground-state! This classical limit can even be used as the starting point for an 
approximation method for d = 3. 

We illustrate this idea for a hydrogenic atom of atomic number Z. The analogue of 
unitary invariance in matrix models is rotational invariance for the atom. So we will 
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consider the problem of minimizing the energy 



E 



^2 dip* dtp Ze^ 



|^(x)p 



(1.11) 



2m dxi dxi 

subject to the unit norm constraint / |^(x)|2(i'^x = 1, in the zero angular momentum 
sector V(x) = ipir) where r = (S^^-^XjXj) 2 . Note that we work in d spatial dimensions 
but retain the three-dimensional Coulomb potential. While fluctuations in all observ- 
ables vanish in the ?i — > classical limit, only the fluctuations in rotationally invariant 
observables vanish in the d ^ 00 classical limit that we study below. 

We first transform from Xi r, which introduces a jacobian d'^x = ^^f'^^^dr. is 
the surface area of the unit sphere 5"^"^. Thus 



dr dr , 3 . 



(1.12) 



2m dxi dxi 

and < >= J ip*{r)ip{r)Q.^r'^~^dr. We absorb the jacobian by defining a radial wave 
function ^{r) = y/rLdr^'^^^^^'^'4){r) so that it is normalized in a simple way < ^'|^' >= 
/ ^*{r)'^{r)dr. In terms of the radial wave function: 



E 



Zm 
2m 



+ 



[I 



Ze" 



^'(r)|2 + 



n\d-l)[d-Z) Ze^ 



^(r) 



dr 



dr 

(1.13) 



where we have integrated by parts and ignored the surface term which vanishes for square 
integrable wave functions. We see that a portion of the kinetic energy coming from 
derivatives of the jacobian and integration by parts now manifests itself as a correction 
to the three-dimensional Coulomb potential. The hamiltonian is 



fi' d^ 



2m dr^ ~'~ 8m 



fi^ (d- l)(d-3) Ze^ 



(1.14) 



To take the d ^ 00 limit, we must re-scale to variables that have a finite limit: p 



vr 



-i A- H = —■ a 



H 



d 



The hamiltonian and commutation relation are 



2^ fi (d— l)(d — 3) Za _ i 
2m Smp^ d^ p ' ' d 



(1.15) 



We see that the large d limit holding % finite is a classical limit, vr and p commute 
and fiuctuations in the rotationally invariant variable p are small. In this limit, we 
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get a classical mechanical system whose phase space is the half plane {p > 0, vr} with 
hamiltonian and Poisson bracket 

The main difference between this classical limit and the usual /i ^ limit is the appear- 
ance of a centrifugal barrier to the Coulomb potential, even for zero angular momentum! 
Thus we have a "classical" explanation for the stability of the atom in the d ^ oo limit. 

The ground-state is given by the static solution vr = 0, p = po = ^^mZa ' which is the 
minimum oiVeff{p) = ~ other words, in the d — > oo classical limit, the 

electron wave function is concentrated at a radial distance of po from the nucleus. To 
recover fluctuations in the electron position we must quantize this classical theory. The 
ground-state energy is = 14// (po) = — . To quantitatively compare with the 

known answer in 3 dimensions we revert to the old variables E = dE and e = a d^ and 
set d = 3 

^exact — TT To 

2 h 

_ _2mZV 

^approx — g I ) 

Alternatively, we could have compared the energy per dimension. Thus we have a crude 
first approximation to the ground-state energy of hydrogenic atoms. Similarly, po pi'o- 
vides a first approximation to the mean distance of the electron from the nucleus. 

This idea has been extended to many electron atoms as well. One way of imple- 
menting the Pauli exclusion principle for many electron atoms is to let the number of 
spin states of the electron tend to infinity along with the number of dimensions. Then 
the wave function is taken to be totally anti-symmetric in spin quantum numbers. For 
example, D. Herschbach and collaborators jSIl have calculated atomic energy levels in a 
2 expansion. The leading term is less accurate than other methods such as Hartree-Fock 
theory. However, its advantage is that the complexity does not grow as fast with the 
number of electrons, since the problem can be reduced to the algebraic minimization of 
an effective potential. Even more impressive is the spectacular accuracy they achieved 
by including the corrections in an asymptotic series in inverse powers of d. By resum- 
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ming this series, they obtained an accuracy of more than 9 significant figures for the 
ground-state of the Hehum atom . 

Thus we see that the d — > oo classical limit allows us to understand certain features 
of the theory that the h ^ classical limit misses. It also serves as a starting point for 
an approximation method. We expect the N ^ oo classical limit to play a similar role 
in matrix models and gauge theories. 

The system we have just studied can also be considered as a non-relativistic 0(d)- 
vector model in one dimension (time), where the position Xi{t),i = l,---,d is a d- 
component vector. The restriction to zero angular momentum corresponds to requiring 
all observables to be 0(d) invariant. More generally, the large-A^ limit of 0(A^)-vector 
models are of much interest especially in statistical physics. denotes the number of 
spin projections for example in a Heisenberg ferromagnet. The large-A^ limit was first 
studied in the context of the spherical model by Stanley j96j . The 0(N) non- linear sigma 
model in three spatial dimensions describes phase transitions in three dimensions 

1.4 The Large- A/^ limit: Planar Diagrams and Factorization 

The large- limit was introduced into the study of gauge theories and matrix models 
by 't Hooft, who showed that in the large- limit, with appropriately scaled coupling 
constants, the planar Feynman diagrams (or those that can be drawn on a sphere) domi- 
nate Indeed, the usual perturbative sum over Feynman diagrams can be reorganized 
according to the genus of the Riemann surface on which the diagram can be drawn. 

Let us indicate how this works for a matrix field theory. The dynamical variable is 
an N X N hermitian matrix- valued scalar field A^(x), where a, b are "color" indices. The 
partition function is 



(1.18) 



/N f ( 1 1 
dAexp d'^xl - tr d^A(x)d'' A(x) + - tr A'^(x) + ^ 5fc tr A 

We have kept an over all factor of multiplying the action. The limit fi ^ holding A^ 
fixed is the usual classical limit. Letting N ^ oo holding h fixed is a different classical 
limit. They are not the same because the matrix A(x) depends on A^, though not on 
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h. Let us concentrate on the factors of that appear in a Feynman diagram due to 
the matrix-valued nature of the field. We will suppress factors of h and all space-time 
dependence, symmetry factors etc. It is convenient to use a double line notation, where 
propagators of are denoted by oppositely directed double lines, each carrying one of 
the two color indices. If we also had vectors (like quarks q""), we would denote them 
by single lines carrying the single color index. Consider a connected Feynman graph in 
the perturbative expansion of the logarithm of the partition function (connected vacuum 
diagrams, no external legs). Suppose it has L loops and E propagators (edges). Vertices 
are due to the cubic and higher interactions. Suppose there are Vk vertices of coordination 
number A; > 3, then the total number of vertices is y = X)fc>3 ^fc- 

Now let us think of each loop as the boundary of a face, its orientation is given by 
the direction in which the color index flows on the loop which is its boundary. The 
propagators that make up the boundaries are edges. Since we are considering connected 
diagrams with no external legs, we get an oriented polyhedral surface. In other words, 
a regularization of a Riemann surface. The number of edges, faces and loops are related 
to the number of handles (= genus G) by the formula for the Euler characteristic 

V-E + L = x = 2-2G (1.19) 

Each loop involves a sum over colors and contributes a factor of N. (In the double line 
notation, a loop involves an inner line which forms a closed curve with colors summed 
and an outer line which is not a closed curve) Each propagator (inverse of the quadratic 
term in the action) gives a factor of 1/A^. The Vk fc-valent vertices contribute a factor 
of QkN each. Thus the factors of gk and N associated with any Feynman diagram is 

N^{l/Nf HigkNf^ = Ar^-^ArSfe>3^'= J] g^" = n^-^^ \{ gl" (1.20) 

A;>3 A;>3 fc>3 

Thus diagrams with a common power of N have a fixed genus. Moreover, in the limit 
N ^ CO holding gk fixed, the leading diagrams are those that can be drawn on a sphere 
(genus G = 0, known as planar diagrams). Suppressed by l/N'^, are diagrams that can 
be drawn on a torus (G = 1, sphere with one handle) and so forth. Moreover, we see 
that the logarithm of the partition function is proportional to N"^, so that we should 
define the large- limit of the free energy as F = — limjv->oo ^ogZ. 
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Suppose we had vectors (quarks) in addition to the matrices. It turns out that 
diagrams with P internal quark loops are suppressed by (1/A^)'^ compared to the planar 
diagrams involving the matrix fields (gluons) alone. This is because a quark loop, being 
a single line loop, does not involve a sum over colors. Diagrams with P internal quark 
loops correspond to Riemann surfaces with P punctures. 

Large- A?^ Factorization: These selection rules regarding the dominant diagrams as 
N ^ oo continue to hold even when we consider diagrams with external legs, (i.e. 
expectation values of U{N) invariants). However, there is a further simplification beyond 
planarity, when we consider expectation values of products of U (N) invariants: they 
factorize. Consider a hermitian multi-matrix model. The matrices are Ai, i = 1, ■ ■ ■ M 
where the z's can be thought of as labelling points in space time. The action is a 
polynomial S{A) = S^^""^"Ai^ ■ ■ ■ Ai^ and correlations are given by 

Z = j dAe-"" ; (/(A)) = 11 dAe-"" ^(^) f{A) (1.21) 

Then the expectation values of U{N) invariants factorize. For example, let $ii...i„ = 
%Ai,---Ai„. Then 

{'^n■■■^n^n■■■,J = i'^n-iJi'^n-jJ + (1-22) 

Factorization can be proven perturbatively, (i.e. in powers of the cubic and higher order 
coupling constants S"*^*^'"*" for n > 3). Let us give an example of factorization in the 
very simplest case of the gaussian S{A) = ^ J2i • Then the basic two-point correlation 
is 

mt [AjTci) = j^S,, S2 51 (1.23) 

Wick's theorem says that any correlation is a sum over all pairwise contractions. For 
example (5^ = A^), 

{^^J^ki) = {l^[Ai]l[A,t l^[A,UAi]i) 

^ ^ X xo. ^ ^ X xcxd I ^ ^ X xcixc ^ ^ X xdxb 
I ^ ^ X xo- X<i ^ ^ X xc xb 



Sij6ki+ ■^Sikdji+ ■^6ii6jk = {'^ij){^ki)+0{^) (1.24) 
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Of the three terms on the right, the first corresponds to a planar diagram, the second 
is non-planar (and so suppressed by and the third is planar, but suppressed by 
because it involves a contraction between matrices in two different traces. 

The factorization of U{N) invariant observables in the large-A^ limit implies that 
they do not have any fluctuations. Thus the large- limit is a classical limit for these 
variables. Factorization also holds for the invariant observables of a vector model and 
also for the meson observables ()1.4|) of a model with both vector and matrix- valued fields. 
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Baryon in the Large- A?^ Limit of 
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Baryon in the Large- Limit of 2d QCD 

In two dimensions the gluon field has two polarization states. They can be taken as 
the null and time components. The null component can be set equal to zero by a choice 
of gauge. The time component is not a propagating degree of freedom. It is eliminated 
by solving its equation of motion. This leads to a linear potential between quarks. Thus, 
2d QCD with colors is a relativistic vector model of interacting fermions. By summing 
the planar diagrams of the large- limit, 't Hooft obtained a linear integral equation 
for meson masses and wave functions j53j . However, it was not clear how baryons arose. 
Witten proposed that baryons be described by a Hartree-type of approximation in the 
large- limit, though in a non-relativistic context |in4j . 

In the null gauge, when quarks are null separated, the parallel transport operator in 
the meson variable H1.4() is the identity. Thus, two-dimensional QCD can be formulated 
as a bilocal theory of quark bilinears. Rajeev j^, found a bosonization in terms of bilocal 
meson variables that satisfy a quadratic constraint. The latter is the projection operator 
constraint on the quark density matrix. We review the derivation of this theory from 2d 
QCD and its classical large- limit in Chapter [2j The phase space of this theory is an 
infinite grassmannian, a disconnected manifold with connected components labelled by 
an integer, the baryon number. In Chapter|3J we study the baryon, which is a topological 
soliton, the minimum of energy on the component of the phase space with baryon number 
equal to one. The ground-state form factor of the baryon is determined variationally by 
restricting the dynamics to finite-rank submanifolds of the phase space. This leads to a 
derivation of an interacting parton model. In the simplest case, this interacting parton 
model corresponds to a Hartree-type approximation to an A^-boson system in the large- A^ 
limit. The A^ bosons are the quasi-particles corresponding to valence-quarks whose wave 
function is already totally antisymmetric in color. In the large- A^ and chiral limits, the 
exact ground-state of the baryon occurs on a rank-one submanifold of the phase space, 
corresponding to a configuration containing only valence-quarks. We use the valence- 
quark distribution to model the proton structure function measured in Deep Inelastic 
Scattering at low momentum transfers, where transverse momenta can be ignored as a 
first approximation. 
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Chapter 2 

From QCD to Hadrondynamics in 
Two Dimensions 

In this chapter we review Rajeev's reformulation |87| I86j of 1 + 1 dimensional QCD 
in the null gauge in terms of color-singlet quark bilinears: two-dimensional Quantum 
Hadrondynamics (QHD). The large- limit (where N is the number of colors) of the 
color-singlet sector of two-dimensional QCD is a classical limit of QHD and is a nonlinear 
field theory of a constrained bilocal meson variable valid at all energies. The classical 
phase space is a curved manifold, the infinite dimensional grassmannian whose connected 
components are labelled by baryon number. The Poisson algebra of observables is a 
central extension of the infinite dimensional unitary Lie algebra. Hamilton's equations 
of motion are nonlinear integral equations for the meson field. Free mesons are small 
fluctuations around the vacuum, 't Hooft's linear integral equation for meson masses 
arises as the linear approximation. Even in the classical large- limit, Hadrondynamics 
contains the interactions of mesons. Moreover, baryons arise as topological solitons. 

2.1 Large-iV Limit of 2d QCD 

Two-dimensional QCD is a non-abelian gauge theory of quarks coupled to gluons. The 
quarks are two-dimensional Dirac spinors transforming as vectors in the fundamental 
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representation of the structure group SU{N). The gluons are vector bosons, one- forms 
valued in the Lie algebra of SU{N): traceless N x N hermitian matrices. The number 
of colors is 3 in nature. The action for a single flavor of quarks is 



-—^ J tr F^'^F,.d'x + 



(fx (2.1) 



where F^i, = d^A^, — d^A^ + [^4^, A^]. Here a, h are color indices, g is a coupling constant 
with the dimensions of mass and m is the current quark mass. 2d QCD is a finite quantum 
field theory, there are no ultra-violet divergences. Apparent infra-red divergences occur 
when m is set to zero, but these can be avoided by considering the m = case as a 
limiting case of massive current quarks. 

It is convenient to work in null coordinates t = — x"^, x = in terms of which the 
metric is ds^ = dt{dt + 2dx). dt is time-like and dx is a null vector and the initial values of 
fields are given on the null line i = 0. x is regarded as space and t is regarded as time. The 
components of momentum Edt +pdx are energy E = po and null momentum p = po+pi- 
The mass shell condition = Pq — Pi becomes m? = p(2E — p). So the energy of a 
free particle is E = ^{p + ^). We see an important advantage of null coordinates, 
energy and null momentum have the same sign. So quarks have positive null momentum 
while anti-quarks have negative momentum. Under a Lorentz transformation of rapidity 
( tanh^ = v/c where v is the boost velocity), t e^t, x — > — (sinh0)t -|- e~^x and 
p — > e^p, E — > e~^E + (sinh0)p. Thus, a scaling of null momentum is just a Lorentz 
boost. 

The time and null components of the gauge field A = Atdt + A^dx are At = Aq and 
Ax = Aq + Ai. We work in the null gauge A^ = 0. The reason to use the null gauge is 



7? 



that the parallel transport along a null line is the identity. So meson observables (|1.4() 

/ 

simplify. The two-dimensional quark spinor is g = 4= 

V 2 

coordinates satisfy (7*)^ = 0, (7^)^ = —1, {7^,7^}+ = 2 and a convenient representation 



. The Dirac matrices in null 



IS 




0-1 _ 1 

10/ I 1 



(2.2) 
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where q = q^C where the charge conjugation matrix satisfies Cj^C ^ = (7^)"^- The 
action becomes 



/N » 1 Tn 

dtdx ^2 {d^^tf + x\E- At)x + -{flHri - X^PX) - -jiv^'x + X^v) 



(2.3) 



2g' 

where E = —idt, p = —idx- X X^ therefore canonically conjugate and satisfy 
canonical anticommutation relations 

[xt»(x), Xb{y)U = - VK , [Xa{x),Xb{y)]+ = o = [x^"(x), x^^y)]+ (2.4) 

Neither rj nor At has time derivatives, so they are not dynamical and may be eliminated 
by solving their equations of motion, 

pn = mx, -9^A^fe(x) = — : x''Hx)xb{x) : 
V{x) = jx{x), A'i^ix) = -^Jdy: x^''{y)Xb{y) ■ - y\ (2.5) 

since d'^^\x — y\ = 5{x — y). So eliminating the longitudinal gluons leads to a linear 
potential between quarks. Normal ordering is with respect to the Dirac vacuum 

X^'^{p)\^ >= for p < 0, Xb(p)|0 >= for p > (2.6) 

where x{p) = J x{x)e~^^^dx. The resulting hamiltonian is 

H = J dxx^^^iP + y )Xa -£^J- X^"ix)Xb{x) :: X^'(y)Xa(y) : \\x - y\dxdy (2.7) 

Introduce the color-singlet meson operator (In ()1.4|1 the parallel transport between x and 
y is the identity in the null gauge) 

M(x,y) = -^:X^"(x)Xa(y): (2.8) 

Since the meson observable is gauge- invariant, we may calculate it in the null-gauge 
which is convenient for us. 

The hamiltonian and momentum can be written in terms of M(x, y) after rearranging 
the operators in the normal ordered products (Note: [dp] = ^) 

H 1 /■ 1 u? ~ q'^N f - - \x — v\ 

N^~2j 2^P+^)^^P^P)\-'^P\ + M{x,y)M{y,xy-^dxdy 
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P _ 1 
N ~ ~2 



pM{p,p)[dp] (2.C 



where M{p,q) = ^ dxdye~'^^^^~'^'^^ M{x,y). = m? — is a finite renormalization 
of current quark mass, a kind of self energy coming from the rearrangement of currents 
in the potential energy term. The Lorentz invariant squared mass is P{2H — P). The 
commutation relations can be expressed as 

1 



[M{x,y),M{z,u)] ^ 



(2.10) 



6{y - z) (e(x, u) +M{x,u))- 5{x - u) (e(z, y) + M (z, y)) 

where e{x,y) is the kernel of the Hilbert transform 

e{x,y) = e{x-y)=V J sgn {p)e'P^''-y\dp]. (2.11) 

plays the role of ?i in a conventional quantum system and the large- limit holding 
= g^N fixed is a classical theory of the color-invariant dynamical variable M{x, y) 
with Poisson brackets —iN[ , ] ^ { , } 



{M{x,y),M{z,u)] 



5{y-z){e{x,u) + M{x,y))-5{x-u){e{z,y) + M{z,y)) 



(2.12) 



From these, one can derive Hamilton's classical equations of motion. What are the 
allowed values of M{x,y), i.e. the classical phase space? From the definition in terms 
of quark fields, the classical variable M is a hermitian operator with integral kernel 
M{x,y)* = M{y,x). The fact that quarks are fermions survives the bosonization and is 
encoded in a quadratic constraint on M{x,y). The large- limit of the quark density 
matrix p{x,y) = -^X^^{x)xi{y) must be a projection when restricted to color-singlet 
states: = p. After normal ordering, M{x^ y) = 5{x — y) — e{x, y) — 2p{x, y) satisfies the 
quadratic constraint (e + M)"^ = 1. Moreover, baryon number i? = — ^ tr M which must 
be an integer is a topologically conserved quantum number. Thus the phase space is a 
disconnected manifold with connected components labelled by baryon number. It is the 
infinite dimensional (restricted) grassmannian j84j . In the next section wc will quickly 
review this classical theory on the grassmannian, 2d Classical Hadrondynamics, which is 
equivalent to the color-singlet sector of the large- limit of two-dimensional QCD. The 
quantization of this classical theory, 2d Quantum Hadrondynamics has been shown by 
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Rajeev to be equivalent to the color-singlet sector of 2d QCD for all energies and 
values of N and h. We will see that this large- classical theory is very different from 
the ^ ^ limit, Classical Chromodynamics. 

2.2 Classical Hadron Theory on the Grassmannian 

The color-singlet sector of two-dimensional QCD is an interacting theory of Dirac fermions 
(quarks) which come in N colors, with the constraint that only states that are SU{N) 
invariant are permitted. In the null coordinates we are using, the sign of kinetic energy 
and null momentum are the same. So consider the direct sum decomposition of the 
one particle complex Hilbert space TL = L'^{R) = H- © TI+ into positive and negative 
momentum functions 7i+ = {ip\ip{p) = for p < 0} and 7i- = {iplipip) = for p > 0}. 
Here iplx) = 'ip{p)e^'^^[dp]. The operator e introduced earlier is the sign of momen- 
tum, eip{p) = ^ip{p) for tp G 7{±. Thus e"^ = e and = 1. In momentum space we 
can regard e as a large diagonal matrix with eigenvalues —1 for the negative momentum 
states and eigenvalues -|-1 for positive momentum states. Thus, e represents the Dirac 
vacuum where negative momentum states are completely occupied and positive ones 
empty. The novelty is that, since quarks come in N colors, each completely occupied 
state is occupied by a color-invariant combination of — > oo quarks. 

More generally, for any orthogonal decomposition 7i = L_ ©L+, there is a hermitian 
operator $ which is —1 on elements of L_ and -|-1 on those of -L+. As before, <I>^ = 1. <I> 
represents a more general state than the Dirac vacuum, it has eigenvalue ±1 on states 
that are completely empty or filled. 

Loosely, the phase space of large- iV QCD, the restricted grassmannian ofTC, is the set 
of all its subspaces that do not differ too much from TC^. By the above correspondence, 
it is also the set of hermitian idempotent operators <I> which do not deviate too much 
from the vacuum e. 

More precisely, since observables are measured by their deviation from their values in 
the Dirac vacuum, we work with the hermitian operator M = ^ — e. Then the restricted 
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grassmannian is the infinite dimensional manifold 

Gr{n, e) = {M^ = M, (e + M)^ = 1, tr M^M < 00} (2.13) 

Tliis is the phase space of Classical Hadrondynamics. We recognize that M is just the 
large-A^ limit of the meson operator M{x,y) = — : x^°'{x)Xa{y) ■ of the previous 
section. The Grassmannian describes only those states of the multi-particle Fock space 
of the quantum theory SJ^^g^gA^'H'. K'^"H+ that are wedge products of single particle 
states (up to a scalar multiple). These are the coherent states, ones with a good classical 
limit. The more general states of the Fock space which are linear combinations of such 
wedge products are not included. Thus the grassmannian is embedded in the projective 
space of the fermionic Fock space with a large co-kernel: the Plucker embedding j24|l7fjj. 
We will return to this when deriving the interacting quark model from a variational 
approximation to the classical theory on the grassmannian (see ^3.2.41 ^3.4.3|) . 

As we saw earlier, the grassmannian is a disconnected manifold, each connected 
component being labelled by the topologically invariant integer S = — ^ tr M which is 
baryon number. Though neither ^ nor e has a finite trace, the trace of the difference is 
well defined. 

The restricted grassmannian carries an action of the restricted unitary group U{TC,€). 
$ U^U"^ preserves the conditions = 1,^^ = ^, tr — e)^ < cxd if 

g G U{n, e) = {U\U^U = UU^ = 1, tr [e, U]^[e, U] < 00} (2.14) 

The action on M is 

M ^ UMU^ + UeU^ - e. (2.15) 

Moreover, any ^ may be diagonalized by a unitary U and brought to a standard form 
with eigenvalues ±1. However, points on disconnected components of the phase space 
cannot be connected by a unitary transformation since the latter preserve B, which is 
integer- valued. Thus, the action is transitive on each connected component. Indeed, 
the grassmannian is the homogeneous space U{Tl,e)/U{H^) x U{H^). We will use this 
transitive action of the unitary group to show how to include anti-quarks and 'sea' quarks 
in a simple way, starting from valence-quarks alone (see ii3.4.4p . 
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The Lie algebra of the restricted Unitary group is easily obtained by taking U = 
e" ~ 1 + u, we get 

U{n,e) = {u\u = tr [e,u]^[e,u] < 00} (2.16) 

The infinitesimal action on M is M [u, e + M]. This brings us to the Poisson structure 
on the phase space. The fermion bilinears M form a representation of a central extension 
of this restricted unitary Lie algebra, as we see from the Poisson brackets (|2.12p . This is, 
in fact, the natural Poisson bracket on the grassmannian: it is invariant under the action 
of the unitary group. This requirement can be used to obtain the Poisson brackets of 
non-linear functions of M as well: 

{f{M),M} = -i[f'iM),e + M] (2.17) 

The dynamics is specified by the hamiltonian we obtained from the large- limit of 2d 
QCD, which may be regarded as a real-valued quadratic function on the classical phase 
space 

E{M) =]im^ = fhp+!^)M{p,p)[dp] 
N^oo iV 2 J 2 p 

+ 9_ I M{x,y)Miy,xy-^^dxdy (2.18) 

Total momentum is 

P = P/N = -^J pM{p,p)[dp] (2.19) 
Time evolution is given by Hamilton's equations 

^ = {E{M),M} = -i[E\M),e + M] (2.20) 

where the integral kernel of E'{M) is (Note: J-V is a finite part integral, a rule for 
integrating the ^ singularity, see Appendix 1^ 

E'{M){p,q) = —2T:5{p-q){p+f—)-^J'V / -^M{p + r,q + r)[dr] (2.21) 
4 p 4 J 

Thus, the equations of motion are quadratically non- linear integral equations for M(p, q). 
A further complication comes from the quadratic constraint (e -|- M)^ = 1. Thus the 
large- iV limit of 2d QCD is a highly non-linear interacting theory of mesons. 
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Let us first consider static solutions, given by [E'{M),€ + M] = 0. The simplest 
static solutions are given by the extrema of the energy E'(M) = 0. The minimum of 
energy on each connected component of the phase space describes the ground-state of 
Classical Hadrondynamics for a given baryon number. 

On the baryon number zero component of the phase space, the obvious static solution 
is M = 0, the vacuum. Small oscillations around the vacuum in the B = sector describe 
an infinite number of non-interacting mesons. To see this, we linearize the equations of 
motion around AI = 0: 



2 dt 2 

M{p + s,q + s) sgn (q) - sgn {p)M{p + s,q + s) 



(2.22) 



where K{p) = ^(p+^) is the kinetic energy. Moreover, the constraint {e+M)'^ = 1 when 



linearized becomes 



^{Pi q) = so that one may take p > 0, q < 0. 



sgn {p) + sgn (g) 
The equation of motion after eliminating the constraint is 

2 = -^i^ip) - K{q)]M{p,q) + ^9 J ^2 i^^l (^.23) 

Since the vacuum M = is translation invariant, the total momentum P = p — q is 
a conserved quantity. The remaining independent variable can be taken as momentum 
fraction ^ = p/P, < ^ < 1, and we look for small oscillations M{p,q,t) = e*'^*x(0- 
The resulting eigenvalue problem for uj is 

M\{0 = [^ + T^Ui) - -TV t Tp^dr^ (2.24) 



e (l-e)^"'"' vr Jo {i-r^f 
where Ai"^ = 2u)P — P^ is the square of the meson mass. This is the linear integral 
equation for the meson spectrum of 2d QCD in the large-A^ limit. It was originally 
obtained by 't Hooft by summing planar Feynman diagrams ^j. 

Thus, the 't Hooft linear integral equation for meson masses corresponds to the linear 
approximation around the vacuum, to our non-linear and non-local formulation of large- 
2d QCD. We do not know of any simple way of deriving this non-linear theory by 
summing planar diagrams. It is only in this linearized approximation that mesons are 
non- interacting in the large- limit. 
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Heuristically, we may regard free mesons as an infinitesimal departure from the vac- 
uum since they involve only the promotion of 0(1) quarks from the Dirac sea (where 
each fully occupied state contains 0{N) quarks) to a positive momentum state. Thus 
the large- limit is crucial. 

What then is a finite disturbance to the vacuum? It must involve the addition, 
removal or rearrangement of 0{N) quarks. The simplest example of such a state is a 
baryon. It is an old idea of Witten that baryons may be described in a Hartree Fock 
approximation in the large- limit. He outlined this in a non-relativistic context |in4j . In 
what follows we will develop the fully relativistic theory of the baryon in two dimensions 
and see that a variational approximation to this theory is indeed a kind of Hartree- Fock 
approximation to a many body theory. We turn to the study of the baryon now. 
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Chapter 3 

Ground-state of the Baryon 

The ground-state of the baryon in the large- hmit is the static solution corresponding 
to a minimum of energy on the baryon number B = 1 component of the phase space. 
Thus we have a realization of Skyrme's idea |93l 1941 1951 lllj that baryons must arise as 
solitons of a theory of mesons. The essential novelty is that our classical bilocal theory 
of the meson variable M{x,y) is not a low energy effective theory, but is equivalent to 
the large-A^ limit of 2d QCD for all energies. In particular, this theory has an infinite 
number of mesons, unlike many low energy effective theories which contain only a finite 
number of mesonic excitations. This chapter is based on our papers j7Ul 1631 IBl] . Other 
useful references are [SHI El EZ] • 

We first describe a numerical method for determining the minimum energy configu- 
ration, based on a geometric adaptation of steepest descent to the curved grassmannian 
phase space. 

Then we will present a variational approach. The primary hurdle in the minimization 
is the quadratic constraint satisfied by M{x,y). An ascending family of sub-manifolds 
of the grassmannian is found, corresponding to operators of increasing rank. They allow 
us to replace the constraint with simpler ones. The minimum on these sub-manifolds 
provides a variational approximation to the true minimum. The dynamics on these 
reduced phase spaces is shown to correspond to interacting models of quarks and anti- 
quarks. Thus we reconcile two seemingly disparate pictures of the baryon: the soliton 



Chaptei\^ Ground-state of the Baryon 



37 



and parton pictures. 

The minimum energy configuration M(p, q) is the form factor of the baryon. The 
diagonal elements M{p,p) are the quark and anti-quark probability densities in the 
baryon in the large- limit. We determine these by a variational approximation. As an 
application, we use these to model the quark distributions measured in deep inelastic 
scattering f ^3.5|) . 

Small oscillations around the ground-state of the baryon would correspond to excited 
states of the baryon, such as A and N* . We do not discuss these here. One could also 
study the minimum of energy on the sectors of higher baryon number; they correspond 
to two-dimensional nuclei. 

3.1 Steepest Descent on the Grassmannian 

To determine the ground-state of the baryon, we must minimize the energy H (12.181) on 
the baryon number one component of the phase space holding the momentum P 1)2. 19(1 
fixed. A Lorentz invariant formulation is to minimize the squared mass A^^(M) = 
P{2H — P) subject to the Pauli exclusion principle quadratic constraint (e + M)^ = 1 
and baryon number B = tv M = 1 constraint. The quadratic constraint complicates 
matters significantly since we are minimizing a non-linear function of M(x, y) on a non- 
linear manifold, the grassmannian. 

The first method we tried to solve this problem with, is steepest descent. The latter 
is a way of minimizing a real-valued function, usually on a linear space. It is an iterative 
technique: beginning with a conveniently chosen initial configuration, we move a small 
distance in a direction opposite to the gradient of the function. The procedure is repeated 
at the new point. After a sufficiently large number of iterations, we reach a minimum of 
the function. 

Now, the grassmannian is not a linear space, due to the quadratic constraint. It is 
good to keep in mind the sphere embedded in three dimensional euclidean space when 
thinking of the grassmannian. Indeed, the sphere is the simplest finite dimensional 
grassmannian. A tangent vector V to the grassmannian at the point ^ = e + M must 
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satisfy y]+ = 0. This comes from linearizing the constraint = 1 at the point 
$. We can think of the anticommutator as the generahzation of the dot product: the 
condition that a vector be tangential to the sphere is implemented by requiring that its 
dot product with the radius vector at that point vanish. Now, the gradient vector M'{M) 
is in general not tangential to the grassmannian. So if we move in a direction opposite 
to it, we will fly off the grassmannian! Actually, there is more bad news: if we move in 
a straight line in the space of all hermitian operators M, along the tangential projection 
T of M'{M), we will still leave the grassmannian. This is analogous to moving along a 
tangent vector to the sphere. Now T = where Y = |[<I>, A^'(<I>)]. To understand 

the formula for the tangential projection, think of the commutator as the analogue of 
the cross product. Taking the cross product of an arbitrary vector Ai' with the radius 
vector <I> produces a vector Y tangential to the sphere, but rotated by a right angle. 
We need to take a second cross product with the radius vector ^> to get the tangential 
projection T. 

A way to stay on the grassmannian is to move along a curve 7(t) = $(t) — e which 
starts out tangential to the tangential projection T of the gradient vector M' . The 
natural choice for a curve is the great circle, i.e. the geodesic. The great circle through a 
point ^ on the sphere tangential to a vector T is obtained by rotating the radius vector 
$ about an axis orthogonal to T. In our situation, Y is the necessary vector orthogonal 
to T. Moreover, the analogue of rotation is the action of the unitary group on the 
grassmannian (see ^2.2|) . So a rotation about the axis Y is implemented by the unitary 
operator e'^^ . In other words, the geodesic through $ tangential to T and parameterized 
by r is 

$(r) = e"^$e-"^ (3.1) 

We are finally ready to formulate the steepest descent algorithm for the grassmannian 
by taking small steps along the geodesic that goes in the direction of maximal reduction 
of squared mass. 

1. Start with an initial configuration Mi and a small r. 

2. At the k^^ configuration Mk evaluate the gradient M.'{Mk) and the rotation gen- 
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erator = ^[e + Mk,M'{Mk)]. 

3. Let Mfc+i = e"^^* [e + Mjt]e~'^^'= — e (M transforms inhomogeneously unlike $ H2.15() ) 

4. Repeat till there is no further reduction in the mass of the baryon. 

We implemented this steepest descent method numerically using Mathematica. The 
value of the time step r was arrived at empirically to ensure quick convergence. As 
for the initial configuration, we used the solution of the separable ansatz, a more ana- 
lytic variational approximation method which we turn to next. We do not present the 
numerical results here, some earlier numerical results are given in the paper j7Uj . 

3.2 Separable Ansatz: Formulation 
3.2.1 Rank-One Configurations 

M{p,q) depends on a pair of variables. In Feynman's valence parton model |17[ I3bj . the 
valence-quark distribution is a function of a single momentum variable. Since we know 
that this gives a reasonable description of the proton, it must be a good approximation to 
assume that M{p, q) is built out of a single function of one variable, i.e. it is a separable 
or rank-one operator: 

Ml = -2 ip (g) ^p'' i.e. Mi{p,q) = -2 ^{pW{q) (3.2) 

The quadratic constraint (e + M)^ = 1 becomes 

(e - 2-0 ■0''")^ = - 2eV' -0"^ - 2^/^ (g) '0"^e + A^p ® ip^ (3.3) 

The constraint is satisfied if ip{p) vanishes for p < i.e. eip = ip and is of unit norm ip^ip = 
1. The former means this configuration contains no anti-quarks. The latter ensures Mi 
is a configuration of baryon number one: B = — ^ tr Mi = 'ip{p)'4'* {p)[dp] = 1. 

Since Mi is unaltered by a change in the phase of ip, the phase space of the rank- 
one ansatz is the projective space on positive momentum wave functions V{TC+). The 
Poisson brackets of M(p, q) are satisfied if 

Wp),^)} = = {rip), rip')}, 
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Mp),rip')} = -i27T6{p-q) (3.4) 
The hamiltonian when written in terms of ip is 

°°lr . /^^i|7/ M2rJl I ^ ~2 f \ I / m2|;/ \|2l'^~y| 



^^1^ = ^[p+jUiprm + fg' J mxrimr^^dxdy. (3.5) 

Thus, we have a self contained hamiltonian dynamical system on the reduced phase space 
of rank-one configurations. 

3.2.2 Quantizing the Separable Ansatz: Interacting Valence Quarks 

In order to make precise the relation of the separable ansatz to an interacting valence- 
quark model, we quantize the classical theory of the previous section. We denote the 
parameter that measures quantum corrections by jj. The constraint on norm is imple- 
mented by restricting attention to states \ V > satisfying 



00 ^ * 



<V\ {p)ij{p)[dp]\V >= 1. (3.6) 
Jo 

The Poisson brackets go over to commutation relations 

[kp),kp')]=0 = [i'\p),i'\p% [kp),i\p')] = ^27r6ip-p'). (3.7) 

A representation for these commutation relations is given by the canonical commutation 
relations of bosonic creation-annihilation operators: 

[kp),kp')] = = [Hp), Hp')], Hp), Hp')] = 27r6{p - q). (3.8) 

where ib = — 1=6, = — 1=6^. In terms of these the constraint becomes 

<V\ b^p)b{p)[dp]\V >= N (3.9) 





Thus we have a system of bosons, so N must be an integer. The hamiltonian operator 
becomes 

NEi = \[p+^]h\p)h{p) [dp] + £^J 6t(x)6t(y)^^6(y)6(x) dxdy. (3.10) 
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The baryon must be in a simultaneous eigenstate of the hamiltonian and momentum 
operators, whose eigenvalues En and P are the energy and total momentum of the 
baryon. The momentum operator is 

roo 

P= p h\p)b{p) [dp]. (3.11) 

JO 

If we work in the momentum basis, an particle momentum eigenstate is 

|pi,P2, • • • ,P7V >= h\pi)b\p2) ■ ■ ■ h\pN)[Q > . (3.12) 

A general state [^p > containing N particles is given by the wave function ip{pi, ■ ■ ■ ,pn), 
where 

roo 

[ip >= [dpi]- ■ ■[dpN]i'{pi,- ■ ■ ,Pn)\pi,- ■ ■ ,PN > ■ (3.13) 
Jo 

The expectation value of the hamiltonian in such a state is 

<i;[NE[i;> = / [p, + t^]\i;{p,, . . . ,p^)\^[dpi] ■ ■ ■ [dpN] 

Jo f^^2 Pi 

2iVyo 2 

It is now clear that we have a system of N bosons interacting through a linear two-body 
potential in the null coordinates. What are these bosons? They are just the valence- 
quarks of the parton model, whose dependence on color has been factored out. Though 
quarks are fermions, the total baryon wave function must be totally antisymmetric in 
the color indices so that it is invariant under SU{N). Thus, the baryon wave function 
(suppressing spin and flavor dependence) ^' factorizes as 

§(ai,pi;a2,P2; • • ■■,aN,PN) = ea^^a2,-aM'^{Pl^P2, ■ ■ ■ ,Pn)- (3.15) 

Here denote color indices. Thus, if we ignore color, valence-quarks behave like bosons. 

In the parton model, the momentum fraction carried by a quark satisfies < = 
^ < 1. How do we recover this in our picture? Null momentum is positive for a quark. 
Thus, the momentum of the particles created by 6^(p) is always positive whence the total 
momentum P is a positive operator. On an particle momentum basis state, P is just 
the sum of individual momenta, each of which is positive: 

P\Pi,P2 ■ ■ ■ ,PN >=[pi+P2^ Pn]\pi,P2, ■ ■ ■ ,PN > ■ (3.16) 
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An eigenstate of P with eigenvalue P must satisfy 

bi H Pn] 4>{(^i,Pi; ■ ■ ■ OAT, pat) = P(t>{ai,pi; ■ ■■aN,PN)- (3.17) 

Since each of the momenta pi are positive, it follows that < ^ < 1. Thus, quantizing 
the rank-one approximation to our classical hadron theory gives us an interacting valence- 
quark model [701 inS] • 

3.2.3 Hartree Approximation and the Large-A^ limit 

It is an old idea of Witten jlU4j that for the baryon, the large- limit should correspond 
to a sort of Hartree approximation to a many body theory. We have a relativistic 
implementation of this idea in two dimensions. 

We would like to find approximate simultaneous eigenstates of Ei and P. The ground- 
state of a many boson system can often be described by mean-field theory: each boson 
moves in the field created by all the others. Moreover, all the bosons can be assumed 
to occupy the same single particle state in this ground-state. The naive choice is a wave 
function ip which is a product of single particle wave functions. But such a choice would 
not be a momentum eigenstate, so we enforce it by requiring the momenta to add up to 
P. 

'4>iPl,P2, - ■ ■ ,Pn) = 2TT6C^Pi - P)ijj{pi)ip{p2) ■ ■ ■ i^ipN)- (3.18) 

i 

In particular, the fraction of momentum carried by each parton is less than one tp{p) = 
unless < p/P < 1. In terms of the single parton wave function, the energy 1)3. 14(1 of 
the baryon per quark becomes {N{N — 1) N"^) 

EiW= ri[p + -mpndp] + ^ I |^(x)|V(y)P^^^^dy- (3.19) 

Jo 2, p 2 J 2 

Here, we are retaining the condition < p < P. The remaining correlations imposed by 
the factor SQ2iPi — P), are suppressed for large- A^. As a compromise to the momentum 
eigenstate condition we will require the expectation value of P to equal P. This is 
analogous to using the canonical ensemble as an approximation to the micro-canonical 
ensemble. This gives rise to the momentum sum rule: 

N / p\Tp{p)\'^[dp] = P. (3.20) 
Jo 
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The wave function also satisfies the normaUzation condition \'ip{p)\'^ [dp] = 1. 

We see that the Hartree approximation to the interacting valence-quark model has 
reproduced our classical hadron theory restricted to the separable submanifold of the 
phase space, except for the requirement that the wave function vanish beyond p = P. 
This subtle difference is a semi-classical (finite A^) effect. To recover the true classical 
limit, we must let N become truly infinite. Since the total momentum is an extensive 
variable, P — > oo as — > oo. So for N = oo, the valence-quark wave function is not 
required to vanish for any finite value of p. 

3.2.4 Pliicker Embedding and Density Matrix 

In order to find the Fock space state corresponding to the classical rank-one ansatz, 
we use an infinite dimensional version of the Pliicker embedding |241 I79| 18 7j of the 
grassmannian in the Fock space = X]5^s=o ^^W- "Xi A^H^. Here 7i = Ti- © W+ is the 
splitting into positive and negative momentum subspaces. 

Given a point M, the state in T to which it is mapped by the Plucker embedding, is 
the wedge product of the occupied states. These are the eigenstates of the density matrix 
p = ^(1 — M — e) with eigenvalue 1 or equivalently, the —1 eigenstates of <1> = e + M. In 
the case of the rank-one ansatz M = —2ip (>i) tp^ , we look for states x with 

= (e - 2^ = -X (3.21) 

Every negative momentum state (i.e. those with ex = — x) is a solution. The only other 
solution is X = since, eip = tp and tp^ip = 1. These are the filled states. Thus the Fock 
space state (denoted \Mi >) corresponding to the rank-one ansatz Mi is a semi-infinite 
wedge product: 

•••/'' A /I' A V (3.22) 

Here f^' is a dual basis in TC^. The only interesting information in this state is its 
departure from the vacuum. If we denote by |0 > the wedge product • • • A we see 
that |0 > is the filled negative energy sea. Then \Mi > is obtained by populating 
starting from the vacuum |0 >. 

|Mi >= bl\0 > (3.23) 
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Here, the quasi-particle created by blp is not a quark, but the colorless quasi-particle of 
the rank-one ansatz. This point of view is easier for computations and has the advantage 
of avoiding the unobservable color quantum number altogether. 

The quantized rank-one baryon can also be described in terms of the creation an- 
nihilation operators of colored quarks. Let \V > denote the second quantized state of 
the baryon. In the Hartree approximation, it is determined by the condition that the 
expectation value of the density operator in | y > must equal the classical density matrix 
of the rank-one ansatz: 

<V\Xp,q)\V>=pi{p,q) (3.24) 

For the rank-one baryon pi{p,q) = 'ip{p)'4^* (q) + ^{2Tr6{p — q)){l — sgn p). A short 
calculation shows that 

\V>=q'l..ql^\0>. (3.25) 
satisfies the above requirement. Here, q'^ creates a quark with color a in the state ■0. 



These operators satisfy canonical anti-commutation relations (CAR): {qiu,qp} = ^1 < 
u,v > with respect to the Dirac vacuum |0 >: q"^ |0 >= and g^^^lO >= where ^^-{p) 
vanishes for p > and '>p+{p) for p < 0. 

This formalism will be useful when we generalize the theory to include sea and anti- 
quarks (section 13.4.2(1 . Before doing so, we discuss the actual determination of the 
ground-state of the baryon within the rank-one approximation. 

3.3 Separable Ansatz: Solution 

The energy of the baryon in the separable ansatz is 

E/N = r\{p + ^mp)\''[dp\ + ^ r mx)\''\^ljiy)\^\^^dxdy (3.26) 



where = — Its momentum is 

P = P/N= / p\ij{p)\'^[dp] (3.27) 
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A Lorentz invariant formulation is to minimize the squared mass, which in null coordi- 
nates is M"^ = 2EP - P2 ( ^n\ : 



while holding li^ip)]' 



p\'iij{p)\'^[dp] 



m 



1 



\i^ip)\' 



5 vry Jo p 

oo 

+ / \i;{x)\-'\i;{y)\' 



-dxdy 



(3.28) 



1. The only parameter in the theory is the dimensionless 



ratio v = The total baryon momentum P can be scaled out by expressing momenta 
in terms of the ratio ^ = ^. It is convenient to introduce notation for the kinetic, self 
and potential energies: 



KE 



1 

P' 



\i^{p)Wdp\, 



PE 



SE 



1 



1, 7 



TT Jo P 



mp)ndp] 
1, 



dxdy\'ilj{x)\ \'il^{y)\ -\x -y 



Then, 



P*{vKE + SE + PE). 



(3.29) 



(3.30) 



3.3.1 Potential Energy in Momentum space 

While the kinetic and self energies are simple in momentum space, the potential energy is 
more easily expressed in position space. However, the integral equation for minimization 
of energy is simplest in momentum space. Moreover, the condition that ip{p) vanish for 
negative momenta is harder to implement in position space. It implies that ijj{x) is the 
boundary value on the real line of a function analytic in the upper half plane. So it is 
useful to rewrite the potential energy in momentum space. However, the kernel of the 
integral operator is singular, and we take some care to define it correctly. We write 



PE 



|V'(x)py(x)(ix, where V{x) 



\mr^^dy. 



The self-consistent potential V{x) obeys Poisson's equation V"{x) = \il}{x) 
a pair of boundary conditions. 



1/(0) 



mvT^dy, y'(0) 



\ /"OO 

2 



-y)? sgn {y)dy 



(3.31) 
^ along with 

(3.32) 
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If we define the momentum space wave function and potential by 

rP POO 

ip{x) = / ip{p)e'P''[dp] and V{r) = / e-''^'=V{x)dx (3.33) 

JO J-oo 

then the potential energy becomes 

PE = FV J [dp] J [dr]4j{p)ip*{p + r)V{r). (3.34) 

When the limits of integration are not indicated, they are from the lower to upper limit 
of the interval over which the integrand is supported. Poisson's equation in momentum 
space becomes 

Vip) = J^V^ f [dq]iPiq)riq-p) = J'V^Wip) (3.35) 
p J p 

We see that V{p) ~ ^ is singular for small p and use J-V to denote an appropriate 
"finite part". W{—p) = W*{p) is supported from —P to P. The rules for integrating 
this singularity are given by the boundary conditions to Poisson's equation: 

J'V J ^W{p)[dp] = J \i;{y)f\^dy and (3.36) 



p 

1 ~, 1 



oo 



iTV I -W{p)[dp] = ^ I {-my)\^ + m-y)\''}dy (3.37) 
J P ■i Jo 

We will primarily be interested in the ground-state of the baryon. Assuming that the 
ground-state wave function is real, W{p) is even. In Appendix 1X1 we show (for real wave 
functions ip{p) which vanish at the origin like p"',p > 0, a > 0) that Ty(0) = and 

(3.38) 

J-p p^ J-p p^ ttP 

Thus we can rewrite the potential energy as 



PE = -TV / ' V H = ^ + 2 / ' ' " n' ' " [dr] (3.39) 
' - ttP jn 



-p ■nP Jq 

Here W{0) = 1 and |H^(r)p = |H^(— r)p even if ■0(p) is not real. Notice that in the 
large-A^ limit, where P = oo, the term may be omitted. We can now derive the 
integral equation for the minimization of the baryon mass. 
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3.3.2 Integral Equation for Minimization of Baryon Mass 

The condition for the minimization of Ai'^ is equivalent to a non-hnear integral equation 
for the wave function ^{p). Recall that 

= P * {i^KE + SE + PE) . (3.40) 

We vary this with respect to ip*{p) and impose the constraint on the norm through a 
Lagrange multiplier A. The condition for an extremum is: 

P{u 6KE + 6SE + 6PE) + 6P{uKE + SE + PE) = \^{p). (3.41) 

Here 

5PE = -TV 6 r *^^r^* [dr] and ffi^ = i^(p + r). (3.42) 
J-p r2 67h*(p) 



Thus, 



5PE = -TVr''[dr]til±I^-TVr ldr]t^l^^ 
J-p Jp-p 

= 2TV [ [dr]V{r)i^{p-r) (3.43) 
J-P+p 

This leads to the non-linear integral equation 

P -i'ip) - —i'{p) + 2TVr V {r)4>{p - r)[dr] 
P vrp J~P+p 

+ p^{p){KE + SE + PE)=Xi){p). (3.44) 



We can write 5PE in a more convenient form, so that the self energy term —^'4){p) is 
cancelled. Let us work in the limit = oo so that P = oo. Since W{—r) = W*{r), we 
have 

5PE(p) = -2TV / ^^ \W{r)Mp - r) + Wi-r)tlj{p + r)l + 2 / Vi-r)iljip + r) [dr] 
Jo Jp 

(3.45) 

The second integral is finite, but with hindsight we add and subtract from it. We 
may rewrite the first integral using the definition of the "finite part" (Appendix^. Note 
that W{0) = 1. 

SPE = ^tM. _ 2 r ^^[i)(p - s)W(s) + i){p + s)Wi-s) - 2ij{p)] 
irp Jo 
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-2/ L^[^l;{p + s)W{-s)-i^{p)]. (3.46) 



p 



Thus 6SE is cancelled by the first term in 6PE. The integral equation becomes: 

-i>{p) - 2 J^^^i^P - s)W{s) + i>{p + s)W{-s) - 24^{p)] 

— [ip{p + s)W{-s) - i^{p)] + g P = sV'O 



oo 



P 

The quantities P, ii'E', SE, PE (defined in ^3.3(1 depend on the wave function and are 
to be determined self-consistently. We now study this integral equation, to find the 
ground-state of the baryon. Let us begin with the behavior of the wave function for 
small arguments. 

3.3.3 Small Momentum Behavior of Wave function 

To understand the behavior of the valence-quark wave function for small momenta, we 
do a Frobenius-type analysis of the integral equation for the minimization of the squared 
mass of the baryon: 

-lu ~ 1 ~ fP ^ ^ 

XMp) = P -Tpip) Tjj{p) + 2TV Vir^Tpip-rMdr] 

IP vrp J-P+p 

+pi){p){vKE + SE + PE) (3.48) 

For small p, we may ignore the last term on the RHS and the one on the LHS, since the 
other terms are more singular in the p — > limit. Dividing by P > we get the integral 
equation 

iu - -)-^(p) + 2TV r V{q)i;{p - q)[dq] = (3.49) 
vr p Jp-p 

where the self-consistent potential V{q) ~ ^ for small q. Now we assume a power law 
behavior 'ip{p) ~ for small p > and derive an equation for a. 



(.-i) = 2^p/' (3,50) 



where y = ^- Since p « P, we have 



y2 y2 
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h versus a 




0.02 0.04 0.06 0.08 0.1 



Figure 3.1: h{a). The solution for the exponent is the point at which the horizontal 

2 

line ^^^p- intersects the curve h{a). For m = 0, a = is a solution: in the limit of chiral 
symmetry, the wave function goes to a non-zero constant at the origin. 

The first of these integrals is singular and we evaluate it according to the definition of 

the finite part (Appendix 1^ . The result is a transcendental equation for a: 

u ^ - r (l + yr + (l-yr-2 ^ 2Fi(l -a,-a, 2 -Q,-l) 

TTu = h{a) = / K ay - IH (3.52) 

Jo I - a 

2 

Here 2F1 is the Hypergeometric function and v = It is easily seen that for m = 0, 
a = is a solution. In the limit of chiral symmetry, iIj{p) ~ p*^ as p ^ 0. Calculating 
h{a) shows that for a positive current quark mass m, the wave function rises like a power 

ip{p) ~ p", a > 0. The plot shows h{a). The solution for the exponent a{m) for m > is 

2 2 
the point at which the horizontal line ^^^p- intersects the curve. For sufficiently small 

2 

we may solve the transcendental equation for a, vr-Jj- = h{a) analytically. Expanding 
h{a) in a power series we get for small a, h{a) ^o^- Therefore, a — > 

Our conclusion that in the chiral limit the momentum space wave function tends to 
a non-zero constant as p ^ implies that it is discontinuous at p = since ip{p) = 
for p < 0. Therefore, the corresponding position space wave function decays like ^ as 
|x| 00. This may be expected from another point of view. The baryon we are studying, 
is a soliton of the meson field M{x,y). It should be possible to approximate the baryon 
wave function with the meson wave function for large spatial x. At large distances we 
are essentially far away from the baryon. The meson wave function has been calculated 
by 't Hooft ^53^ in the large- limit of 2d QCD. In the chiral limit, the 't Hooft meson 
wave function has a discontinuity at p = and consequently decays like ^ as |x| 00. 
This is a non-trivial consistency check. 
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3.3.4 Exact Solution for = oo,m = 0: Exponential Ansatz 

There is an exact solution to the integral equation for minimization of baryon squared 
mass in the chiral and large- limits: V'(p) ~ e^^,p > where b is fixed by the Lorentz 
reference frame. Moreover, we find that the mass of the baryon vanishes for this par- 
ticular rank-one configuration. Therefore, in the chiral and large-A^ limits, the exact 
minimum of (mass)^ occurs for a configuration with no anti-quarks. While this is to 

2 

be expected in the non-relativistic limit ^ ^ oo 68 , it is rather surprising in this 

2 

ultra-relativistic chiral limit, since conventional wisdom would suggest that as ^ > 0, 

the phase space for production of virtual qq pairs is enhanced. We find this is not true 
in the two-dimensional theory. 

To see this, let us derive the analytic solution. We will use a variational approach 
that also allows us to estimate corrections due to finite m and From the Frobenius 

analysis (section l3..S.3j) we know that the wave function must vanish like a power as 
p ^ 0^ with a ^ O"*" in the chiral limit. For N = oo the valence-quark wave function is 
not required to vanish for any finite value of p. But for finite N, it must vanish beyond 
p = P. If the former is to be a good approximation, it must fall off rapidly for large 
momenta. 

As the simplest ansatz that satisfies these boundary conditions and also facilitates 
explicit calculation, we take the variational wave function '4>{p) = p°'e~^^. The Lorentz 
invariant parameter a is determined by minimizing the (mass)'^. In the null coordinates, 
a rescaling p — > Ap is actually a Lorentz boost in the longitudinal direction. Therefore, 
b is determined by the choice of the reference frame. For the purposes of determining a, 
let us work in a reference frame in which 6 = 1. We will fix b subsequently, based on the 
momentum sum rule, which serves to pick a reference frame. 

In the notation we introduced earlier, = P* {vKE + SE + PE). The normalized 
wave function for the exponential ansatz is ^a(p) = \J 2-'^-'^°T{i+2a) 'P'^^~^ ' -^"-"^ non-zero 
current quark mass, KE diverges for negative a. Therefore, it is physically reasonable 
to consider only positive values of a. In position space. 
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Then 



P(a) = i±^^ KE(a) = -, SE(a) = - — . (3.54) 
2 a ira 



The potential energy PE{a) = \ipa{x)\'^Va{x)dx is determined by solving V^' (x) 
\i{ja{x)\'^ subject to bounda 
V^{0) = 0. Integrating once, 



|V'a(2;)P subject to boundary conditions T4(0) = x\'ip{x)\dx = -^-j=^jY^^ and 



,/M^ -^"+°'^'T'7'i-^> . (3.55) 

0Fr(2 + a) 



And the second integration gives 



, , r(«)[(l + x^)-° + 2ax2 2Fi(l, 1 + Q, j, -x^)] 



We arrive at 



2l-2a^2 

"^^^"^ ~ r(i + 2a)r(l - a)(r(i + a))3(- sin (na) + sin (3to)) ' ^^'^^^ 

The Frobenius analysis suggests that the (mass)'^ is minimized if a = in the chiral 
limit. Therefore, we expand PE{a) in a Laurent series around a = 0: 

1 vra 12C(3)a2 ^ „^ ^ ^ 

PE{a) = — + - ^^^ + 0{a^). 3.58 

vra 3 vr 

The pole at o = in the potential energy is cancelled by a similar pole in the self energy. 

Alternatively, we can calculate PE{a) in momentum space using the definition of the 
"finite part" integrals (see ^3.3.1(1 . We use this as a check on our definition of J-V. Let 



Then 



^.(ri=P°e-^ ^^,_,,_4;,^,^.,, . (3.59) 



PE(a) = -2 I H(fil__L[<i,], (3,60) 



where 

Wa{s) = / Va(s + t)ra{t)[dt\ for s > 0, VF(-s) = W*{s). (3.61) 



We get 



(2s)2+°'^Ki . (s) CSC vra 

^^°'->- r(-„)r(r+2„) P-«^> 
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where Kn{z) is the modified Bessel function of the second kind. This results in the same 
expression for PE{a) as before. Cohecting the results, the baryon (mass)^ is: 

52iV2 ^ 2 '^a r(i + 2a)r(l-a)(r(i+a))3(-sin(7ra)+sin(37ra))^ ■ ' 

For small 'a', we may write 

g'^N'^ 2 a 6 vr 

The Chiral Limit 

In the chiral limit m = ^ = 0, 

is the product of two factors, both of which are monotonically increasing functions for 
small a. Therefore, the baryon mass is minimized for a = 0, with a minimum value of 
0. Since the (mass)'^ is a Lorentz scalar, it is unchanged if we Lorentz boost this wave 
function. Thus, the {mass)"^ must vanish for ip{p) = Ae^~^P^ for any 6 > 0. 

Assuming the positivity of (mass)^ on physical grounds, we conclude that in the 
large-A^ and chiral limits, the absolute minimum of (mass)^ occurs for a configuration 
containing only valence-quarks. We emphasize that the exponential solution is not just 
the absolute minimum of the baryon mass on the space of separable configurations but 
on the entire phase space of Hadrondynamics, in the chiral and large- limit. 

The masslessness of this two-dimensional baryon may be puzzling at first. But there 
is a precedent for this, 't Hooft found that in the chiral limit, the lowest lying meson 
was massless in (the large-A^ limit of) 2d QCD 53 . Moreover, numerical calculations 
of Hornbostel et. al. [HHI also show that the baryons and mesons of 2-dimensional QCD 
are massless in the chiral limit. 

Momentum Sum Rule and Determination of '6' 

The valence-quark wave function in the chiral and large- limits is ipip) = Ae~^^ , p > 0. 
Normalization gives A = \/47rb. A Lorentz boost is just a rescaling of momenta, so the 
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momentum sum rule 



oo 







P\i^{p)\ [dp] = P/N = P (3.66) 
fixes the reference frame and determines b = \/2P. Therefore, 

~ /27r ,~P\ 

This is the exact valence-quark wave function for N = oo,m = 0. Thus the exact form 
factor of the baryon in the large- and chiral limits is 

M(p,g) = -(^)e-(^+^)/^^ (3.68) 

For finite N, we work in a reference frame in which the total baryon momentum is P = 1. 
In the case = 3, this corresponds to P = P/N = |. So to compare this wave function 
with that for finite A^, P = 1/3 is the appropriate choice. 

3.3.5 Effect of Finite Current Quark Mass 

2 

We estimate the corrections due to non-zero u = variationally using the above 
exponential ansatz. From the Frobenius analysis, we already expect that a — > 0"*" as 
— > 0^. To investigate whether there is a minimum in a, we expand ^2^"$ in a 
Laurent series: 

M'^(a) u a-K 6C(3) ^, o,, ,^ 

g'^N'^ 2a 6 3 vr 

We see that for small a, the O(^) and 0{a) terms dominate and have opposing tendencies. 

Thus there is a positive value of a for which is minimized as fig. 13.21 shows. For 



any given we can solve the transcendental equation for the minimization of Ai" 



J2 

numerically and find the optimal a. We show this in fig. 13.31 We see that a ^ 0"*" 
as -|t- 0"*", recovering the exact exponential solution in the chiral limit. However, 

2 

for small enough current quark mass, we can get the asymptotic power law a{-^) 

3 m 



analytically. Minimizing in a, we find that a ^ v ^ found from the Frobenius 
analysis (section EH- Moreover, ^ + j^^jf^^- 

In other words, for sufficiently small current quark masses, our variational estimate 
for the valence-quark wave function is 4^{p) = Ap"'e"^P, with a = ^^J^. The variational 
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Figure 3.2: Estimate of 4 baryon mass'^ in units of g'^N'^ as a function of a. The curves 



top to bottom are for 



.05, .02, .005, .001. As 4^ 



0"*" , both the minimum value of 



(mass)'^ and the value of a that minimizes it tend to 0"^, recovering the exact massless 
exponential solution in the chiral limit. 




Figure 3.3: The power a in the wave function ip{p) = A]fe plotted as a function of 
1000#. 
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upper bound for the baryon mass is A4 = gN{^)'i (y) 2 . Though the baryon mass is of 
order N, in the chiral hmit, the proportionaUty constant vanishes in two dimensions! 

3.3.6 Variational ground-state for Finite N 

We have seen f section l3.'2.3() that the leading effect of finite is to restrict the range of 
quark momenta to < p < P. Thus V'(p) vanishes for p > P. For finite, but large-A^ we 
expect the ground-state wave function to be weh approximated by the exact exponential 
solution H3.67() for N = 00, in the chiral limit. Since 

(1 - - (3.70) 



the wave function 



^P{p)=A{l-^)\0<p<P (3.71) 



should be a good ansatz for the ground-state. From the above argument, we expect b to 
be proportional to N. More generally, the Frobenius analysis (|3.3.3j) shows that 

i^ip) = Ap'^il - |)^ 0<p<P (3.72) 

should be a good ansatz for the ground-state even for non-zero current quark mass. A is 
fixed by normalization; a is determined by the minimization of baryon (mass)'^ . From 
the Frobenius analysis we know that for small a ^ \/ . b can be eliminated 
using the momentum sum rule: 

rP 
10 

Thus 



N I p\i'{p)f[dp] = P (3.73) 



b=--l + a{N-l) - l + J-^(N-l) (3.74) 

2 2 \ IT g 

which is proportional to N as we expected. The valence-quark probability density is 
In the chiral limit, 

y(^) = (Ar-i)(i-^)^-2 (3.76) 
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which agrees with to the distribution obtained from numerical discretized Ught cone 
calculations by Hornbostel et. al. in [58] . 

3.4 Beyond the Separable Ansatz 

3.4.1 Fixed Rank Submanifolds of Grassmannian 

We have been studying a reduced dynamical problem on the separable submanifold of 
the phase space. By the variational principle, the minimum of energy on a subset of the 
phase space provides an upper bound for the true minimum. To improve on this estimate, 
we must allow for a larger set of configurations. We will now give an increasing family 
of submanifolds of the phase space, whose union is dense in the grassmannian . Each of 
these is the phase space for a reduced dynamical system. They turn out to correspond 
to interacting quark models, that go beyond the valence-quark approximation. 

Let us generalize the rank-one ansatz to a rank r ansatz (a, b here do not stand for 
color, the meaning should be clear from the context): 

r 

Mr= J2 ^b^- ® V'^'- (3.77) 

a,b=l 

We must pick and the ipa such that the configuration Mr is admissible: trM^ < oo 
and satisfies the Pauli exclusion principle: Mj. = Mr, {Mr + e)"^ = 1. The admissibility 
condition is automatic since Mr is a finite-rank operator. We pick the tpa to be a finite 
number r of orthonormal eigenstates of e, the sign of momentum. 

ei'a = ealpa, = ±1- (3.78) 

Suppose there are r_|_ positive momentum vectors and r_ = r — r+ negative momentum 
vectors. Let e = Diag(ea) be the diagonal r x r matrix representing the restriction of e 
to the sub-space of 7i- ®7ij^ spanned by the tpa- This ansatz satisfies the constraints if 
the r X r matrix ^ is hermitian and satisfies the constraint 

{e + if = l, (3.79) 

a mini- version of the constraint on Mr- If we let </> = e -|- 1^, then the constraints on (f) are 

= ^'^ = 1 (3.80) 
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Thus 4> (and hence ^) hes on a finite dimensional grassmannian . The connected compo- 
nent of this finite dimensional grassmannian is fixed by Baryon number: B = —^trMr = 

Specializing to the case of baryon number one, it is easily seen that the simplest 
solution to the constraints on ^ is r_|_ = l,r_ =0, so that ^ is just a 1 x 1 matrix, the 
number -2. Then Mi = — 2i/^ tf:'^ . This is just the rank-one separable ansatz that gave 
rise to the valence-quark model. 

To go beyond the valence-quark approximation, we need to find other non-trivial 
solutions r-|.,r_,^ which satisfy these constraints. If r = + r_ = 2, the only non- 
trivial solutions to the constraints reduces to the rank-one solution. To see this we first 
note that for r = 2, is a point on the finite dimensional grassmannian of C^, the set 
of subspaces of C^. This grassmannian has three connected components, corresponding 
to tr = —2, 0, 2. The baryon number constraint implies that tr(j) = r+ — r_ — 2. For (j) 

to lie on the finite dimensional grassmannian , we must have r+ = 1,2. The case r+ = 1 

, /-I \ /I \ . (-2 0\ 

gives (f) = \ , and e = , so that ^ = <p — (- = \ , which is the 

V -i) Vo -1/ V 0/ 

same as the rank-one solution. 

The case r+ = 2, r„ = may also be reduced to the rank-one ansatz. In this case 
tr (;/) = 0,e = 1. Therefore, <f) \s a point on CP\ which may be parameterized as 
(p = 1 — 2zz^ . Here z e C^, ||z|| = 1, though ip is independent of the overall phase of 
z. Then ^ = — e = —Izz'^ and M2 = —2{zz^)'^ipa ^ "ip^^ ■ However, we notice that M2 
has a gauge symmetry, under the simultaneous action of U{2) on both z and V- This 
symmetry may be used to bring z to the standard form . It is then clear that M2 
reduces to the rank-one ansatz. 

Thus, r = 2 does not provide any new solutions. The physical reason for this is 
clear. Roughly speaking, r = 2 corresponds to a baryon that contains only valence and 
sea quarks (r^ = 2) or a baryon that contains only valence and anti-quarks (r^ = 1). 
However, we know that a consistent description of the baryon is possible only if we have 
all three components: valence, sea and anti-quarks. This suggests that we should try a 
rank three configuration. Indeed, a new non-trivial rank three solution exists; we discuss 
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it in the next section. 



3.4.2 Rank Three Ansatz 

The next simplest possibihty is of rank three, with r-( 
/-I 0\ 



2,r_ 



1 so that e 



10 



Let us denote the negative momentum vector ■(/'_ and the two posi- 



V 1/ 

tive momentum vectors It will turn out that the former represents anti-quarks 

while the latter represent valence and "sea" quarks. The constraints on cp = e + ^ are 

<^t = ^ 1^ = _i (3.81) 

Each solution to these constraints determines a one dimensional subspace of C^, the — 1 
eigenspace of 4>; i.e., a point in CP^. We may parameterize CP'^ by unit vectors in 
C^, whose overall phase is irrelevant: 



-l+2c®c^ 



1, C~e*% 



(3.82) 



Letting ( 



Co 



we get 



/2IC-P 
2CoC 

V2C+C 



2C-a 

-2 + 2|Co 

2c+a 



2C-q 
2Coa 

-2 + 2|C+|V 



(3.83) 



Then M3 = ^^V'a ® V'''^ h^-s a C/(l) x SU{2) a global gauge symmetry. The phase of the 
negative momentum vector ip- may be changed and the two positive momentum vectors 
and V'+ may be rotated into each other, provided we transform (" in the same way. 
Let us use the three degrees of freedom in the SU{2) symmetry to rotate the vector 

fC-\ /o\ 

^ J to the standard form j, with C+ real and positive. Since ||C|| = 1, C+ = 
a/1 — IC-P- Then M3 has the form 

Ms = -2V'®V"^ + 2|C-P[^-®V--V'+®^+] + 2[C-C+V'-®V'+ + C+C-V'+«'V'-]- (3.84) 
We see that M3 is still invariant under 



C- 



C- 



(3.85) 
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In the case of the separable ansatz, the phase space of rank-one configurations was 
V(J~L+). The analogous space of rank three configurations is more involved. V and 
determine a point in the flag J-l2^i{T~i+)- ip- determines a point in the projective space 
P{7i^). The total space of rank three conflgurations is then a bundle over Tl2^i{7i+) x 
P{7i^), with fibre the unit interval [0, 1], parameterized by the absolute value of 

We may use the C- ~^ e*'-"+ freedom to make C- real. Thus, the rank three 
configurations which satisfy the constraints are given by 

Ms = -27P V"^ + 2C [V'- V'- - ^ + (S)ipi + iJ+^ V-]- (3.86) 

Baryon number may be expressed as 

B = --trMs = {mp)? + e- P+{p)? - \^-{-p)\''\}[dp]. (3.87) 

The total momentum P = —i^N J pM{p,p)[dp] becomes 

_ p POO 

P = ^r= pmp)\''\ + eA\i>~{-p)? + \Mp)?]}[dp\ (3.88) 



N 

The variational quantities V'-) "0+1 C- a-rs determined by minimizing the energy of 
the baryon. The kinetic energy is simple in momentum space 

foo 1 ,,2 

K= / -[p+^]{\m\^\ + (,-U-{-p)\^ + \^+{p)?]}[dp]. (3.89) 
Jo 2 p 

The expression for potential energy is simpler in position space, but is lengthy since it 
is quadratic in AI{x,y). Unlike in the rank-one case, our wave functions also depend on 
a discrete index. A discrete analog of the Fourier transform, which diagonalizes ^ makes 
the potential energy simpler. Let, 

V'l = ^{ V[i - C-IV'- - 41 + C-IV'+I (3.90) 

^2 = ^( V[l + C-]^- + 41 - C-]^+| (3.91) 



In this basis 



4 

f^C- 
0-2 



is diagonal. However, the new wave functions V'l 2 



V -2C-J 
are no longer eigenstates of e. M3 becomes ('i/'o :='*/')• 



- ^Ms = V'o ® V-o + C-{V'i ® V'l - ^^2 ® V'l} (3.92) 
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And the potential energy is given by 

U = ^y'||V'o(x)|Voo(x)+eiV'l(x)l'^ll(x) + e|V2(x)|V22(x)|dx 

+^2 Re J S^C-M^Wi{x)Vio{x)- 

C_MxW2{x)V2oix) - C!V'i(x)V'2*(x)^2i(x)|dx. (3.93) 
The mean-fields V^^ are defined as 

Vapix) = \j dyMy)\x - y\r^{y) (3.94) 

and satisfy the hermiticity condition V^^(x) = V^aix)- They are the solutions to the 
differential equations 

Vafiix) = %l)c,{x)i)*p{x). (3.95) 

with the boundary conditions 

K/3(0) = \j dyi^o.{y)\y\rfi{y) (3.96) 

and 

^Lpix) ^6^13, for |x| oo. (3.97) 

Thus we have defined the phase space and hamiltonian of the rank three ansatz. 
As in the rank-one case, (section ESS), we can work out the Poisson bracket relations 
among ip, ijj± implied by the Poisson brackets of the M{p,q). We can then quantize 
this classical dynamical system to obtain an interacting quantum system of quarks and 
anti-quarks, whose color is not being counted explicitly. These particles would be the 
rank three analogs of the bosons we encountered in the rank-one ansatz. However, we 
will follow a different strategy this time, which clarifies the connection of the rank three 
ansatz to the quark model. 

3.4.3 Fock Space Description 

In what follows, we will give two Fock space descriptions of the rank three configurations. 
In the first case, we will generalize the rank-one (A^ = oo) formula \Mi >= 6^|0 > to the 
case M = M3. This will give us a description of the baryon in terms of quasi-particles 
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which are certain colorless combinations of the true quarks and anti-quarks. Next we 
will generalize the (finite N) formula |y >= ■ ■ ■ 9^^|0 > in order to get a description 
in terms of colored particles. 

In order to find the Fock space state corresponding to M3, we use an analog of the 
Pliicker embedding j24[ I79j of the grassmannian in the Fock space, just as we did in the 
rank-one case in ^3.2.41 The filled states x ^-re —1 eigenstates of <^3 



^3 X = (e + Cb^ai^'^) X 



X 



(3.98) 



There are two types of solutions to this eigenvalue problem. First let us look in the 
orthogonal complement of the subspace spanned by {tpa}a=i- 



ex 



-X and < X >= 0, 6 = 1, • • • r. 



(3.99) 



This means x is a negative momentum state, orthogonal to ■0,V'±- Thus, all negative 
momentum states, orthogonal to ip- are filled. 

For the other type of solution, we consider those x that lie in the span of the V'a- Let 

X = Ea=l Xai^a- 

-Xa- 

/-1 + 2IC-I' 



eaXa + iaXb 

Thus, we are looking for —1 eigenvectors of e -|- ^ 











-1 



(3.100) 

2C-C+ \ 




1 

Vo/ 



and x^^^ 



V 2C+C- 1-2|C-|V 
are the solutions. Thus, the states 





X^^) = V and x^^^ = -C+V'- + C-t 



(3.101) 



are also filled. Thus the Fock space state corresponding to the rank three ansatz M3 is 
a semi- infinite wedge product: 

• • • /2' A /I' A V A (-C+V'- + C-V'+) (3.102) 

Here is a dual basis in the orthogonal complement of "0- in Ti- . As before, we focus 
on the deviation from the vacuum: 



|0 >= 



(3.103) 
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Then IM3 > is obtained by populating ^jJ and (— C+V'- + C-V'+) and depopulating -0-, 
starting from the vacuum |0 >. 

IM3 >= (-C+6;_ + C-bl^)b^.bl\6 > (3.104) 

However, the operators blp_^,blp,b^j_,b^ are not the creation-annihilation operators of 
quarks. They create and annihilate the quasi-particles of the rank three ansatz. This 
point of view is simpler for computations, and avoids the unobservable color quantum 
number. 

However, we are more accustomed to thinking in terms of the Fock space of quarks 
and anti-quarks, which carry the color quantum number. We now determine the state 
consisting of "valence", "sea" and anti-quarks, |yS'j4 > that corresponds to the rank 
three ansatz. We will see that the advantage of this point of view is that it has a dual 
description in terms of an embellished quark model, which we discuss in the next section. 

Recall that the Fock space state \ V > corresponding to the rank-one ansatz is 

\V>=q'^---q^^\0> (3.105) 

where, q'^ creates a quark with color a in the state ip. These operators satisfy canonical 
anti-commutation relations. As before, the state jyS'A > in the Hartree approximation 
is determined by the condition that the form factor of the quark density matrix n the 
solitonic state jyS'A > should be the classical density matrix in the large- limit 

< VSA\p{p,q)\VSA >= ps{p,q). (3.106) 

Here p{p,q) = ^q^Hp) qa{q) and hiP^O) = i(2vr5(P - q) - M3{p,q) - e{p,q)). While 
it was easy to guess the form of \V >, we will use an indirect method to find jyS'A >. 
We know that \V > satisfies the condition < V\p{p, q)\V >= pi{p,q)- We proceed by 
finding the operator that transforms pi into p^. Then we represent this operator on 
the Fock space and use it to transform \V > into |yS'j4 >. This is feasible since the 
grassmannian carries a transitive action of the infinite dimensional restricted unitary 
group (section I2.2|l . We may obtain the rank three configuration = M3 + e from the 
separable ansatz <I>i = Mi + e by a unitary transformation: 

C/t(e -2iP^ tP^)U = e + 4"Va ® V'"^^ (3.107) 
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(We work with $ = Af + e or p since they transform homogeneously under unitary 
transformations, while due to normal ordering, M does not.) The infinite dimensional 
restricted unitary group is disconnected, unlike its finite dimensional counterparts. How- 
ever, since both Mi and M3 are configurations of baryon number one, U lies in the 
connected component of the identity, and is of the form U = e*"^, for hermitian A. From 
the expressions for AIi and M3, 

Ml = -2 ■0 ® 

M3 = -2 ^p (g) ip^ + 2C! [V^_ V- - V'H- ® V'l] 

+2 C-C+ii^- 4^1 + i^U- (3-108) 

We see that U must be the identity except on the span of ip- and '(/'+• The restriction of 

/-I 0\ 

$1 to this subspace is —a^ = , while the restriction of <I>3 is so"i + (r — 1)cj3, 

V ij 

where r = 2('^ and s = 2C_(^_|_. Here cjj are the Pauli matrices. Thus, the restriction of 
A to this subspace satisfies 

e-'^{-a3)e'^ = sai + (r - 1)cj3. (3.109) 

Therefore A is a 2 x 2 traceless hermitian matrix cj.w. Here w is the vector in R'^ about 
which (0, 0, —1) must be rotated by an angle 2|w| to reach (s, 0, r — 1)). We get 

A{p,q) =iaj:csm{C-){i^-{pW+{q) -Tp+{pW-iq)) and U{p,q) = e'^^'''\ (3.110) 

Let us denote 6 = arcsin(C_). Therefore we have {p = ^(1 — ^>)), 

-^^(P'^)pi(r,s)e*^(^'^)H[ds] = ps{p,q). (3.111) 



Now we look for a representation of U{p, q) as a unitary operator U acting on the quark 
creation annihilation operators. Moreover, suppose we can show that 



UU{q)U = J[ds]q{s)UHs,q) 
U^\q)U = f[dr]U{p,r)q\r). (3.112) 



Then 

\VSA >=U \V > . (3.113) 
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since for this choice, the expectation value condition is satisfied: 

< VSA\^q^{p) q{q)\VSA > = < V\UU^ {p)UU^{q)U\V > 

"[drjlds] < V\U{p,r)q'^{r)q{s)U\s,q)\V > 
[dr][ds]tj{p, r)pi(r, s)U'<{s, q) 
= P3{p,q)- (3.114) 

It only remains to find the second quantized representative U and show that the quark 
creation and annihilation operators transform in the desired manner under it. U may 
be obtained by promoting the wave functions that appear in U = e*^^^''^^ to operators: 

We sum over colors to produce a color-singlet. We can check that q{p), q^{q) have the 
desired transformation properties under U. For example, we show that U'^q'^{p)U = 
J[dr]U{p,r)q'f {r). For this we will need the preliminary result 

[A, q^p)] = i^+{p)ql_ - i^-ip^l^ (3.116) 

which is easily verified. Then 

U^\p)U = e-'^q\p)e'^ 

= {l-iA + 0{ef) q\p) (l + iA + 0{ef) 
= q\p)-i[A, q\p)]+0{ef 

= q\p) - i{^+{p) j [dr]'iij*_{r)q\r) - ^^{p) J [dr]iP\{r)q^r)) + 0(9)^ 
= qHp) - i I [dr]q\r){ij+{p)i)*^{r) - i^_{p)i;l{r)) + 0{ef 
[dr][2T^5{p,q)+iA{p,q) + 0{ef]q\r) 

U^q^{j))JJ = I [dr]U{p,r)q\r) (3.117) 



as desired. We conclude that 



\VSA >= e~^^\V >= > _ (3_-^^g) 
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jl/SA > is the second quantized Fock space state corresponding to the rank three ansatz 
M3. Here the g^", qa create and annihilate colored quarks. We see that > may be 

obtained from the valence-quark state by a rotation. In the next section we will show 
that the same result could have been obtained by a Bogoliubov transformation, applied 
to the valence-quark model. 

3.4.4 Valence, Sea and Antiquarks: Bogoliubov Transformation 

The interacting valence-quark model we derived earlier (section 13.2.2)1 ignored the anti 
and sea-quark degrees of freedom. Now, we show how they can be included by perform- 
ing a unitary transformation on the Fermionic Fock space. This is like a Bogoliubov 
transformation that mixes positive and negative momentum states. 

In general, the baryon can be in a superposition of states containing N+r] quarks and 
•q anti-quarks for ry = 0, 1, • • • , cxd. This is because only the difference, baryon number, 
is a conserved quantum number in the full theory. The valence-quark approximation 
amounts to ignoring all states for 77 > and in the Hartree approximation, corresponds 
to the state |y >= • • • q^Mo >. 

Let us continue to work within a factorized Hartree approximation, ignoring corre- 
lations except when required by the Pauli exclusion principle or color-invar iance. The 
next simplest possibility beyond the valence approximation is to consider only states with 
r] < N. In the ground-state of the baryon, we find that this is a good approximation. 

The quarks cannot all occupy the positive momentum state ip{p) as in the valence 
approximation, because they would violate the Pauli exclusion principle: there is no 
completely anti-symmetric tensor in more than indices transforming under SU{N). 
In the ground-state, it is energetically favorable to introduce just one more positive 
momentum state ip+{p) to accommodate the quarks. As for the anti-quarks, they occupy 
the lowest available negative momentum state V'-(p)- 

To obtain a state containing valence, "sea" and anti-quarks (say ly/Svl >), we apply 
a unitary transformation to the valence-quark state >• It will be determined by the 
variational wave functions and ip- in additional to an "angle of rotation" 9. These 

are to be determined by minimizing the {mass)"^ of the baryon. In order to produce a 
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state |yS'j4 > of norm 1, we require the transformation to be unitary. In analogy with 
a BogoHubov transformation, the operator that creates a quark in state tpj^ and an anti- 
quark in state ^/^_ is aij^^a^^. We have summed over colors to produce a color-singlet 
state. Our variational ansatz for the state jyS'A > is then e ^ ' ' \ V >. 

We see that this is identical to the state jyS'A > we obtained from the rank three 
ansatz, provided we identify with sva.0 where 9 is the angle of the Bogoliubov rotation. 

This point of view confirms the interpretation of tp- as the wave function describing 
anti-quarks, tp and "0+ rnay be interpreted as "valence" and "sea" quark wave functions. 
Their orthogonality can be understood as a consequence of the Pauli exclusion princi- 
ple. < C_ < 1 measures the departure from the valence-quark approximation. The 
probability of finding an anti-quark in the baryon is • 

While the physical meaning of these quantities are best understood in the context 
of the quark model, actual computations are most easily performed in the context of 
Hadrondynamics. We turn next to an estimation of the anti-quark content of the two- 
dimensional baryon. 

3.4.5 Variational Estimate of Antiquark Content 

In ^3.3.41 we found that the absolute minimum of energy (baryon mass) in the limit of 
chiral symmetry occurs for a rank-one configuration, in large- limit. In this limit, 
the anti-quark content of the baryon vanishes. We find that for small values of current 
quark mass << 1), the baryon has a small but non- vanishing anti-quark content. 
We already saw that the leading effect of finite N was to make the wave functions ipa{p) 
vanish beyond the total baryon momentum P. 

To determine the rank three configuration, we can derive integral equations for the 
minimization of Ai. However, from our exact solution, we expect that the anti-quark 

2 

content is small for small current quark masses. To estimate (^_(-|^), we do a varia- 
tional minimization of M, assuming some simple functional forms for ip±,'<p, which are 
suggested by the form of the exact solution. Moreover, we work in the limit N = oo. As 
before, we pick 

i'(j^) = C{lre^, p>0 (3.119) 
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Here P = ^ is the mean baryon momentum per color, C is fixed by normalization and 
b is fixed by the choice of Lorentz reference frame, il^^ must be orthogonal to ijj, so it 
must have a node: 

^P+{p) = C+ilnl - C7i)e^, p > (3.120) 

Here Ci is fixed by orthogonality. In the ground-state the anti-quark wave function has 
no nodes (besides at p = 0), and for ease of computation we pick it to be the same 
function (but for p < 0) as the valence-quark wave function. (Note: this is not required 
by any symmetry.) 

'ip-ip) = i^{-p), p<0 (3.121) 

Since a Lorentz boost in the longitudinal direction corresponds to a rescaling p ^ Xp m 
null coordinates, we pick a reference frame in which 6 = 1, and minimize the Lorentz 
invariant in that frame. This determines the Lorentz invariant quantities C-i'^) 
and a(#). 

The (massf- of the baryon is = P{2E - P) 



1 /■ 1 
+- / dx(iy|M(x,y)p-|x - 2/ 



2p TT J 2p 

(3.122) 



where (z^ = We abbreviate 

P *{vKE + SE + PE). (3.123) 



9" 

Using the rank three ansatz, 

M3 = -2i; 2C! ^1 - V+ ® + 2C- ^Jl-C[■^p- V+ + V'+ ® V"- ] (3-124) 

for the above choices of wave functions we find: 

P(C-,a) = l + 2C'+a(l + 2e), KE{C-,a) = 1±^, SE{C-,a) = -i±^ (3.125) 
2 a na 



PE{C_,a) = {Uo{a) + C^^l-eUl{a) + C^U2{a) + C^^Jl-eU3{a) + CU4{a)) {3.12Q) 

The potential energies Ui{a) can be calculated analytically for the most part, and we 
use the symbolic package Mathematica for this purpose. We must minimize Ai'^ in the 
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strip < C~ !^ IjO^ ^ 0. We find that for small but non- vanishing M."^ has a 
minimum for small but non-zero values of both and a, indicating a small anti-quark 
content. To see that such a non-trivial minimum exists, it suffices to consider the factor 
[vKE + SE + PE), since P is monotonic in C_ and a. Moreover, since C,- turns out to 
be small, it is enough to consider only terms in PE{(^-, a) which dominate for small ^_ 

Pi?(C- ^ 0,a) ^ (Uoia) + C- V^l - eUiia) + eU2{a)) (3.127) 

Uo{a) is the same as in the valence-quark approximation and was obtained in ^3.3.41 For 
small a, 

Uo{a) = — + ^ + 0{a^) (3.128) 
vra 3 

We also find that Ui{a) is negative and goes to zero as a — > 0. As for U2{a), we find, 

2 

U2{a) h constant. a + 0{a^) (3.129) 

vra 

Thus, the terms proportional to ^ in PE and SE cancel out exactly, and we get 

/-2 ^ 

{uKE + SE + PE){C-,a) =dia + u + daaC- + 2z^-^ - - C- (3.130) 

for positive constants di, i = l,---,4. Isolating the a and C_ dependence below we 
see that for > 0, {vKE + SE + PE){(^-,a), and hence Ai^ must have a non-trivial 
minimum for some > 0, a > 0: 

(z.i^i? + 5i? + Pi?)(C-) = /o-M-V^wF + M-, 

{vKE + SE + PE){a) = — - hoa^^ + hia. (3.131) 

a 

Here the fi{a),hj{(^) are positive. Thus we estimate that the minimum of baryon 
{massY' on the rank three sub-manifold occurs for a small but non-trivial anti-quark 
content, when the parameter -J^- is small. However, as the plots (fig. 13.51 &: show, 
in the limit of chiral symmetry, ^ 0, a — > 0, recovering the exact rank-one solution. 

To include the leading corrections the wave functions must vanish beyond p = P. 
Setting = p/P, we pick 

m = cc{i - 0', i^+io = c+ei^ - ci)(i - 0' for 1 > e > 
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Figure 3.4: measuring departure from valence-quark approximation plotted as a 



percent as a function of 10004^. The probability of finding an anti-quark in the baryon 
is i^2(^'^ • '^^^ valence-quark approximation becomes exact in the chiral limit. 
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Figure 3.5: The power governing behavior of the wave functions near the origin, 
plotted 3jS Si function of 
p — > in the chiral limit. 



plotted as a function of lOOOz^ = 1000 -p-. The wave functions tend to a constant as 
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Figure 3.6: Variational upper bound for ^ mass'^ of baryon in the rank three approxi- 

2 

mation plotted as a function of 1000-^. The baryon is massless in the chiral limit. 
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Figure 3.7: Variational approximation for valence-quark density V{^) = 27r|'i/;(^)p in 
the rank three ansatz. It is normalized to 1. 
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Figure 3.8: Variational approximation for sea quark distribution 27r(^^ | ^+ (^) p in the 
rank three ansatz. It is normalized to C^. 




Figure 3.9: Variational approximation for antiquark quark distribution 2TT(^^\'tp{^)\'^ in 
the rank three ansatz. It is normalized to 
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Figure 3.10: Fraction of baryon momentum carried by anti-quarks as a function of 




and = for -1<^<0 (3.132) 

The behavior of the wave functions for p — > 0, is insensitive to the boundary condition 
for large p, so we may use the value of a determined in the large- limit. Similarly, we 
ignore the correction to the small parameter (See Fig 13.51 fc 13. 4j) . b is determined 
in terms of a and (- by the momentum sum rule: 

^ = J%mp)\'\ + eu-{-p)\' + \Mp)mdp]. (3.133) 

Ignoring the small difference between the momentum carried by sea and anti-quarks 
(both of which are suppressed by factors of (^^ compared to the valence-quark contribu- 
tion) we find 

6 ~ -1 + + C-) + a{-l + N + 2NCi) (3.134) 

We plot the valence, sea and anti-quark densities as functions oi ^ = -p for = 10^'^, 
= 3. For these parameters we estimate C- = 4 x 10"^, a = 3.5 x 10^^ and b = 0.57. 



3.5 Application to Deep Inelastic Scattering 

The structure of nucleons (proton and neutron) is measured in Deep Inelastic Scatter- 
ing (DIS) experiments. Electrons (or muons, neutrinos) are scattered off nucleons by 
the exchange of a space-like {q^ < 0) virtual photon (or weak gauge boson). Thus, 
the electro-weak force is used as a probe of the strong force binding hadrons, just as 
Rutherford used alpha particles to study atomic structure. The DIS experiments of 
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the late 1960s and early 1970s jSH] discovered scaling. The structure of the proton was 
roughly independent of the wave length of the virtual photon used as a probe: the mea- 
sured structure functions were independent of q"^. The quark-parton model of Feynman 
|17l I36j explained this phenomenon in terms of point like quarks, which had to be weakly 
interacting at high energies. Soon after, it was discovered that the only renormalizable 
four dimensional quantum field theory of quarks with this property of asymptotic free- 
dom at high momentum transfers is a non-abelian gauge theory of quarks coupled to 
gluons, which we now refer to as QCD. Perturbative QCD predicts small logarithmic 
violations of scaling. The evolution of structure functions as Q^{= —q^) is increased is 
given by the DGLAP equations This dependence has been experimentally veri- 
fied. However, the measured structure functions also contain information on how quarks 
and gluons bind to form a relativistic bound state. This is encoded in the momentum 
fraction ^ = p/P or Bjorken variable xb = dependence of structure functions, or 
parton distribution functions. Like atomic wave functions, they tell us the probability 
of finding a quark with a given momentum p in the proton whose momentum is P. It 
has not been possible to describe the formation of hadronic bound states by perturbing 
around the limit of non-interacting quarks. What is more, unlike the electrons in an 
atom, quarks appear to be permanently confined inside hadrons, and have never been 
isolated. Understanding the bound state structure of hadrons and the phenomenon of 
confinement remains a challenge in 3 -|- 1 dimensions. 

However, in Deep Inelastic Scattering, the transverse momenta of partons are small 
compared to their longitudinal momenta. Moreover, a perturbative treatment of tans- 
verse momenta with an upper cut off Q leads to the same leading dependence for 
structure functions as predicted by the operator product expansion in perturbative QCD. 
We have determined the bound state structure of quarks in a baryon in 1 + 1 dimensions 
in preceding sections. This leads us to propose a model for the xb dependence of struc- 
ture functions at an initial low value of = Qq where transverse momenta are ignored. 
Our quark distribution functions are to be used as initial conditions for evolution 
and compared with experimental data at higher values of Q^. 

It is an experimental fact that at low Q^, about half the momentum of the proton is 
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carried by gluons |21j . However, two-dimensional QCD does not contain any dynamical 
gluon degrees of freedom, which are necessary to find the gluon distribution. Rather 
than two-dimensional QCD, it is more appropriate to study the dimensional reduction 
of QCD from four to two dimensions, which includes the transverse polarization states 
of the gluon field. But understanding the non-perturbative effects of the gluon degrees 
of freedom is substantially harder. Indeed, the second part of this thesis is motivated by 
a desire to do just that. For the present, we account for the omission of gluons by merely 
assuming that the quarks carry only a fraction / of the baryon momentum. Moreover, 
we found that anti-quarks and sea quarks carry a very small portion of the baryon 
momentum in two dimensions (see ^3.4.5() . Phenomenological fits [211 176j to experiment 
show that anti and sea quarks carry very small fractions of baryon momentum at low , 
the bulk of it is carried by valence-quarks and gluons. Thus we make the following ansatz: 
at an initial low value Qq, the valence-quarks carry a fraction / of baryon momentum 
and their probability distribution V{^) = ^\tjj{£,P)\'^ is determined by our approximate 
solution of two-dimensional QCD. 

Now, under evolution, the quark, anti-quark and gluon distributions mix, in gen- 
eral. However, the difference between quark and anti-quark distributions q^{S,,Q'^) = 
q{^, Q"^) — q{^, Q"^) evolves independently of the gluon distribution. This difference is re- 
ferred to as the "valence" quark distribution in the perturbative QCD literature. More- 
over, the experimentally measured structure function F^{xb,Q'^) — Q^ii^Q"^) where the 
Bjorken variable is approximately equal to the momentum fraction ^ up to correc- 

tions suppressed by the ratio of the proton target mass to Q . ~ ^ ( 1 ^r- ] US • 

Here we consider the isospin averaged valence-quark distribution, which corresponds to 

averaged over neutrino and anti- neutrino DIS on isoscalar targets. Finally, the rela- 
tion between the valence-quark distribution function and the valence-quark probability 
density is cj^{^,Q^) = v{Q'^)V{£^) where i^(Q^) is a normalization constant, the pertur- 
batively renormalized number of valence-quarks, given by the GLS sum rule jlB]- It is 
determined to high order since it is obtained by integrating the DGLAP equation from 
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= oo down to Qq 







TT vr ^ 12 3 ^ 



(3.135) 



l2 



where t = log -j^ — I , the number of flavors is Nr = 2 (up and down) and as{t) is the 

\^QCD J 

strong coupling constant. 

Within the rank-one approximation, the valence-quark wave function is determined 
in the large- limit, by minimizing the baryon mass (jT^ holding = 1 subject 

to the modified momentum sum rule 

N r p\i^{p)\''[dp\ = fP (3.136) 
Jo 

where / is the fraction of baryon momentum carried by valence-quarks. Our variational 
solution is tp{p) = C(-p)'*(l — p)^ where in the chiral limit, a = and b = We work 

in the chiral limit of massless current quarks since the up and down quarks are essentially 
massless (5 — 7 MeV) compared to the scale of strong interactions {Aqcd = 200 MeV). 

N —-2 

The valence-quark probability density is then V{^) = (y ~ 1)(1 ~ ^ where we set 
the number of colors to its physical value = 3. According to our ansatz, the valence 
distribution at Qo is q^iCQo) = i^{Qo)V{^). 

The two parameters of our model, Qq and / are to be determined by fitting to data. 
We expect Qq to be low, since we predict the anti-quark distribution to be small at 
Qq. And we expect / to be roughly a half since gluons and valence-quarks share baryon 
momentum roughly equally at low Qq. To compare with experimental data we evolve 
the valence-quark distribution from to to t > to according to the leading order DGLAP 
[l5] equation 



dq^{^,t) as{t) 

-^2 



dt 2tt 



- - ^ g^f,t)(i + 0-2C^(g,t) J, ■ 



C(i-C) 



We use the leading order DGLAP equation since we are evolving only over a small 
range of Qq<Q^< 13GeV^ Here C2 = '^2N^ ■ '^^^ evolution is done numerically by 
discretizing and t with increments of 6^, = .01 and 6t = .025 respectively. Since the 
DGLAP equation is linear, we may normalize q^iC^t) to v{t) at any value of t. But in 
practice we normalize it at to to reduce discretization errors in the region < ^ < SS, 
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XF3 at = 13 GeV^ 
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Figure 3.11: Comparison of predicted xF^ at = 13 GeV^ (solid curve) with measure- 
ments by CCFR (star) at 12.6 GeV^ and CDHS (diamond) at 12.05 < < 14.3 GeV^. 
Ql = 0.4 GeV2 and / = i. 

which contributes significantly to ij[t) since q^{^,t) diverges logarithmically for small ^ 
for t > to- The loss of knowledge of q^{^,t) for < ^ < 6^ for t > to does not affect 
the evolution for ^ > because information fiows from higher to lower values of ^ as is 
evident from the evolution equation. 

has been experimentally measured by the CCFR and CDHS collaborations TB" at 
a variety of values of and xb- The parameters /, Qq should be determined by a best 
fit to experimental data. For now, we assign to them the values / = ^ and Qq = 0.4 
GeV^ suggested by phenomenological fits to data |2H I76j . We may now compare our 
prediction with experimental measurements of F3 at higher values of Q^. In Fig l3.1ll 
we show a comparison with xF^ measurements by the CDHS and CCFR collaborations 
|16j at ~ 13 GeV^. We have ignored the difference between the Bjorken variable xb 
and the momentum fraction ^ which is small at ~ 13 GeV^. The plot shows that our 
model gives a zeroth order explanation of the data. 

Note: Too much should not be read into the values f = \ and Qq = 0.4 GeV^ we have 
picked for these parameters. In a more detailed comparison, and to further constrain 
the fit, one should substitute xbFs which has not been measured for xb > 0.7 with the 
structure function F2. The two are approximately equal for large momentum fractions, 
where the contributions of anti-quarks and gluons are small. It appears that our choice for 
/ and Qo produces a prediction that underestimates F^ for large momentum fractions. 
Increasing / makes the valence-quark distribution vanish slower as ^ — > 1, since the 
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2 

exponent is (1 — ^)~~ at = Qq- Merely changing / from 50% to 60% would change 
the exponent from 4 to 3 at = Qq- Increasing Qq also has a similar effect: smaller 
the range of over which the distributions are evolved, smaller is the momentum that 
flows from large to small values of ^. We do not discuss this any further. Better still, it 
should be possible to eliminate these fits to data and predict the values of both Qq and 
/. This requires a much deeper understanding of renormalization and gluon dynamics 
respectively. We now turn to the problem of understanding matrix-valued dynamical 
gluons, which we have ignored so far. We do this within the regularized context of 
matrix models. 
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Large- Matrix Models as Classical systems 

We now turn to x hermitian multi-matrix models, which are to gluons what 
A'^-component vector models are to quarks. Matrix field theories are much harder to deal 
with since there are of order A^^ rather than A^ degrees of freedom at each point in space- 
time. Non-perturbative techniques and the necessary mathematics to understand them 
are only just being developed. Understanding even a finite number of matrix degrees 
of freedom is a challenge, and there are many conceptually new phenomena to contend 
with. 

We will follow a strategy similar to that in Part ^ and study the large- A^ limit, 
which turns out to be a classical limit. The analogs of the color-singlet meson vari- 
able M ~ X^*(^)Xi(y)) which are unitary invariants of a collection of vectors, are the 
gluon Green's functions, the traces of products of matrices: cyclically symmetric tensors 
Gi = Gi^i2—i„. We find that the classical configuration space is the space of non- 
commutative probability distributions, whose coordinates are the gluon correlations. 
Non-commutative probability theory (introduced in ^ is the probability theory of 
operator-valued random variables, and is an outgrowth of the study of von Neumann 
algebras, especially due to D. Voiculescu [53]. The entropy from confinement of color 
degrees of freedom manifests itself in the large volume of matrices that correspond to 
the same set of correlations, a non-commutative entropy. 

In the euclidean formalism, matrix models are defined by an action 5*, which is usually 
the trace of a polynomial in the matrices. Classical equations of motion for the gluon 
correlation tensors are the factorized Schwinger-Dyson equations, also known as the 
loop equations. The solution of these classical equations is a point on the configuration 
space, a particular non-commutative probability distribution. We discover a classical 
action principle for these loop equations: maximize non-commutative entropy holding 
the correlations conjugate to the coupling tensors fixed. The main subtlety is that 
there is a cohomological obstruction to expressing the entropy in terms of the gluon 
correlations. To circumvent this, we express the configuration space as a coset space, 
taking inspiration from the Wess-Zumino-Witten model |in3j . The automorphism group 
of the tensor algebra acts as changes of variable on the space of probability distributions. 
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Indeed, the latter is a coset space of the former, by the isotropy subgroup fixing a given 
reference distribution. We express the entropy as a function on the automorphism- group 
invariant under the action of the isotropy subgroup. This allows us to get an explicit 
formula for the large-A^ classical action of euclidean matrix models. In other words, 
we express the correlations of a given matrix model in terms of those of a reference 
model and the change of variables that relates the two. This also leads to a variational 
approach to estimating the large- correlations and partition function. We parameterize 
the change of variable in terms of variational parameters, and use the extremization of 
the classical action to find their optimal values. In other words, given a submanifold 
of the configuration space parameterized by variational parameters, our action principle 
finds the point on this submanifold that best approximates the true correlations of a 
given matrix model. The variational estimate for the logarithm of the partition function 
is then the maximum value of entropy. 

Part O of this thesis, is based on our paper [S]. The reader is also referred to 
the related paper [2] and where we study the large- limit of hamiltonian matrix 
models as classical systems. Chapter ^ deals with some mathematical background on 
tensor algebras (free algebras) and non-commutative probability. Chapter El deals with 
euclidean large- matrix models as classical systems. Appendix El discusses the case of 
a single random matrix (one- matrix model). Appendix IHl summarizes some facts about 
the simplest of matrix models, the gaussian. 
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Chapter 4 

Free Algebras and Free 
Probability Theory 

In this chapter we will cover some mathematical background on tensor algebras, operator- 
valued random variables and the non-commutative version of probability theory that is 
relevant to their study. We will follow a point of view that is a synthesis of our own work, 
previous work in the physics literature [ZZlIIHllMlEaEHlllHlIZllElinni CHSl El EOl and 
that of Voiculescu [M] in the mathematics literature. 

A pair of real-valued random variables are statistically independent if their joint 
probability distribution factorizes as a product of individual probability distributions. 
Quantum mechanics can be considered as the ordinary (commutative) probability theory 
of real-valued random variables such as energy, momentum, position and spin; their ex- 
pectation values being given by averaging with respect to a weight specified by the wave 
function. If a pair of quantum systems are independent, then the total wave function is 
just the tensor product of the individual wave functions. Thus, the notion of statistical 
independence of classical probability theory is based on the tensor product. When the 
tensor product is replaced by the free product, the notion of statistical independence 
is replaced by the mathematical notion of freeness. (Note: freeness must not be con- 
fused with free field theory in physics!) The resulting probability theory is called free 
probability theory (developed by Voiculescu [31] )• Of course, not all random variables 
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in free probability theory need be free of each other, just as not all real-valued random 
variables are statistically independent. Real-valued random variables commute and are 
elements of commutative algebras such as the algebra of measurable functions on R^. 
The random variables in free probability theory are no longer real-valued, they do not 
commute in general and are elements of non-abelian algebras of "measurable operators" , 
also known as W* algebras or von Neumann algebras. Thus the study of von Neumann 
algebras is just a non-abelian extension of measure theory. Free probability theory is 
also referred to as non-commutative probability theory. Somewhat counter intuitively, 
the simplest examples of non-commutative probability theory occur when the algebra of 
random variables is maximally non-abelian, i.e. the basic operators (generators) satisfy 
no relations. Then the operator algebra is just a free product of the operator algebra for 
each of the generators. Examples are (1) the free algebra on hermitian generators (full 
tensor algebra) which is associated to hermitian large-A^ matrix models; (2) the group 
algebra of the free group (the algebra of convolution operators on the free group) which 
has to do with large-A^ unitary matrix models; and (3) the Cuntz and Toeplitz alge- 
bras of creation-annihilation operators satisfying the Cuntz relations having to do with 
quantum matrix models in the canonical formalism |72j or the master field formalism for 
matrix models l3f H l99] . 

The three examples are related. For example, the information in the connected 
moments (cumulants) of the tensors in the free algebra can also be packaged as the 
vacuum expectation values of so-called master fields which are thought of as elements of 
the Cuntz algebra. Similarly, the probability theory on (1) and (2) are related by a kind 
of exponentiation. We will follow an approach based on example (1), since it is most 
closely tied to the hermitian matrix models studied in this thesis. 

4.1 Tensor Algebras and Operator- valued Random Vari- 
ables 

In the usual commutative probability theory of M real- valued random variables xi - ■ ■ xm 
we are concerned with determining the expectation values of observables which are func- 
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tions of the random variables: (/(xi, • • • , xm))- These expectation values are determined 
by averaging with respect to a joint probability distribution function (pdf or measure) 
p{xi, ■ ■ ■ ,xm) on R^^. The expectation values of functions that can be written as power 
series in the Xi can be built up from the expectation values of monomials in the random 
variables: the moments Gi^...i^ = (xj^ ' ' ' ^in) which are symmetric tensors since real- 
valued random variables Xi commute. Thus the algebra of observables of commutative 
probability theory can be taken to be the commutative algebra of polynomials (more 
generally formal power series) in M indeterminates. In solving the classical moment 
problem, nineteenth century mathematicians showed that it is possible to reconstruct 
the joint pdf from a knowledge of the moments, under mild hypotheses j7j- Indeed, 
one can formulate almost the entire theory in terms of moments. (But moments do not 
tell the whole story. They are not adequate to compute expectation values of functions 
non-analytic in the random variables, for instance. In fact, one of the most important 
quantities in probability theory, the Boltzmann-Shannon entropy of a probability dis- 
tribution — / p(x) log /9(x)(ix cannot be expressed in terms of the moments alone.) For 
instance, the condition on the positivity of a probability measure /5(x) > translates into 
the moment inequalities: for M = 1, they are the conditions that the expectation value 
of the square of any polynomial f{x) = J2n fnX^ must be positive, J2m,n fmfnGm+n > 0. 
This is the SciniG cLS tliG positivity of the Hcinkel nicitrix Grn,n 

:= Gm+n- Similarly, the 

normalization constraint (1) = 1 becomes G0 = 1 where denotes the absence of any 
indices. 

Now suppose ^1, • • • ,^A/ are M operator- valued random variables. We take them to 
be hermitian. We can think of the as hermitian matrices. What is the analogue of a 
joint probability distribution? If there was just a single random hermitian operator, the 
distribution of its eigenvalues may be regarded as the probability distribution. But when 
we have two or more operators which do not commute, they cannot be simultaneously 
diagonalized. In this situation, the moments come to the rescue and serve to define the 
joint probability distribution. 

First, what is the algebra of observables? We shall consider random variables which 
are polynomials (or more generally formal power series) in the basic ones (the generators) 
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f{^,)=fn-...^^^...^^^^fl^^ (4.1) 

where ^/ '■= ih---in •= " " " ii„ and repeated indices are summed. Thus random variables 
are determined by their complex- valued coefficient tensors Just hke functions, these 
formal power series may be multiplied by complex scalars, added and multiplied "point- 
wise" 

= [/ + 9]' = + [fg]' = 5ijj'^g'^ (4.2) 

where iJ^^^^ = 1 if I1I2 = I and vanishes otherwise. The multiplication is the associative 
direct product of tensors. If we think of the string / as a path in the space of indices, then 
this is the product induced by concatenation of paths. The simplest situation is the one 
in which the satisfy no relations, this can be modelled by x hermitian matrices 
in the limit of large- A^, where the constraints imposed by the vanishing of characteristic 
polynomials may be ignored. In this case, the algebra of observables is the full tensor 
algebra or the free algebra Tm on M generators. Note that the tensors are not required 
to satisfy any symmetry property. TJv/ is the free product of M copies of Ti 

Ta/ = Ti * • • • * Ti (4.3) 

The free product of a pair of algebras B is the linear span of all products of the 
form 01610262 ■ ■ ■ o,nbn where ai £ A and bi £ B are arbitrary. The multiplication is the 
concatenation product given above. Other associative algebras on hermitian generators 
can be obtained from this free algebra by quotienting by the ideal generated by the 
relations satisfied by the generators. 

4.2 Non-Commutative Probability Distributions 

Now, a non-commutative probability distribution is a rule to calculate the expectation 
values of the random variables in the algebra of observables. They are determined by 
the moment tensors 

Gn...i„ = {Cn---U (4.4) 
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By analogy with commutative probability theory, we define the non-commutative joint 
probability distribution to be a sequence of moment tensors Gq, Gij, Gi^^i^, • • • satisfying 
the normalization, hermiticity and positivity conditions 

G0 = 1; G*i = Gr, Gijf*f>0 (4.5) 

for all polynomials f{^) = f^£,i. The positivity condition can be regarded as the positiv- 
ity of the Hankel matrix Gj-j := Gjj. If I = ii ■ ■ ■ in, I = inin-i The expectation 
value of an arbitrary polynomial is then 

ifiO) = fGi (4.6) 

In the case of relevance to matrix models (pure gluondynamics or closed string theory), 
the moments are cyclically symmetric tensors since they are modelled by the expectation 
values of traces of products of matrices. The space of such moment tensors is the space 
of non-commutative probability distributions in M generators, Vm- More generally, if 
we were to include the quark degrees of freedom along with the gluons, we would have 
to deal with open strings of gluons with quarks at the ends H1.4() . The corresponding 
moments would not be cyclically symmetric. 

Hermitian large- matrix models provide examples of non-commutative probability 
distributions. The moments are given by integrals over large hermitian matrices (see 

Gi= lim - f dAe-'^'^^^-^^-Aj (4.7) 

More generally, Voiculescu defines a non-commutative probability space as an asso- 
ciative algebra A over the complex numbers with unit element 1 and linear functional 
(or "state" in quantum statistical mechanics |3H]) t such that r(l) = 1. The elements 
of the algebra are the random variables. He adds probabilistic content to this algebraic 
definition by requiring the algebra to be a weakly closed operator algebra with the ad- 
joint operation a i-^ a*, (a*)* = a {W* algebra or von Neumann algebra). The state 
must be real T(a*) = t*(o) and positive elements must have positive expectation values 
r(a*a) > 0. The case of cyclically symmetric moment tensors above corresponds to a 
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"tracial" state r(a6) = T[ba). Other examples of operator algebras that admit a nor- 
malizable tracial state r include Type I algebras: finite dimensional matrix algebras and 
finite type II factor algebras such as the von Neumann algebra of operators in the left 
regular representation of a group with infinite conjugacy classes (e.g. group algebras of 
free groups). 

The precise definition of freeness can be given within the algebraic context. A family 
of subalgebras Ai of a non-commutative probability space are a free family if 

T(ai • • • a„) = (4.8) 

whenever aj G Ai- with ij ^ ijj^i for 1 < j < n and T{aj) = for 1 < j < n. In other 
words, a family of subalgebras are free of each other if the expectation value of a product 
of random variables vanishes if they each have zero mean and adjacent random variables 
are from distinct subalgebras. Random variables are free of each other if they are from 
subalgebras that are free of each other. 

For example, consider a matrix model with action S{Ai, A2) = Si{Ai)+S2{A2) ■ Then 
random variables Ai and A2 are free of each other in the large- limit. In commutative 
probability theory, the joint pdfs of a pair of statistically independent random variables 
is determined by their individual pdfs. Similarly, the freeness property can be used to 
find all the mixed moments such as G1122 once the pure moments Gu, G22 etc are known: 

= ((66 - (eiei))(e2e2 - (66))) 

=^ G1122 — ^11^22 — G11G22 + G11G22 = 

=^ ^1122 = ^11^22 (4.9) 

The most famous of non-commutative probability distributions is the Wigner distribu- 
tion. We discuss it in Appendix lO 

4.3 Automorphisms and Derivations of Tensor Algebras 

We will be interested in how one non-commutative probability distribution can be trans- 
formed into another. We can use the group of effective transformations as a way to 
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parameterize the space of probability distributions. This group plays a key role in the 
theory to be developed in ^ 

Let us consider transformations (p of the algebra of observables that preserve its 
structure (j){fg) = 4>{f)(t)[g). These are the automorphisms of the tensor algebra Tm 

ii ^ mi = ^ki (4.10) 

provided they are invertible. The composition of a pair of automorphisms is 

[{4^ o </,)]/ = • • • C (4.11) 

The inverse, = i^iiOi determined by the conditions 

[{^|; o <^),]^ = 5f,...p„^f -^"^J ■■■cpf:= 6f. (4.12) 
They can be solved recursively for tpy. 

^1.-.. = -E •••Vt^^.-U<^^---<^k.- (4-13) 

A necessary condition for an automorphism to be invertible is that its linear part (/>J- be 
invertible: det 0*- ^ 0. Even if the linear part is invertible, the above recursive procedure 
may not converge in a finite number of steps. Indeed, we would not be able to find 
inverses of polynomials within the world of polynomials. This forces us to work with 
the "completed" tensor algebra of formal power series in the generators ^j, rather than 
just non-commutative polynomials. This is why we allowed our random variables to be 
formal power series in the basic ones. (Note that the composition rule given above makes 
sense for two formal power series since the operations involve only finite sums.) However, 
we pay a price for using formal power series instead of polynomials for our algebra of 
observables. It may not be possible to evaluate the expectation value of an arbitrary 
formal power series in the generators. However, it turns out that in the large-A^ limit, 
many of the interesting probability distributions are in a sense compactly supported so 
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that moments do not grow too fast (see for example the Wigner semi-circular distribution 
given in AppendixEl an even stronger restriction on the growth of moments is the Wilson 
area law conjecture (Section ^1.2|) ') and it is possible to evaluate the expectation value 
of many interesting observables. 

We will further restrict to the "orientation preserving" automorphisms, det > 0. 
Thus, the automorphisms of the tensor algebra form a group Gm = Aut Tjv/. For M = 1 
it is the diffeomorphism group of the real line, where the real line is thought of in terms 
of the commutative algebra of formal power series Ti on it. 

gi = {^(x) = £ 01 > o| (4.14) 

n=l ^ 

with the composition law 



fc=l \-lk=n 



and inverses given by Lagrange's recursion relations 

-j^ n—l 

(f)-'^=lP- il;^(f,^ = l- ^„ = _ ^-0^ 4)1^- ■ ■ 4)1^^. (4.16) 

^1 fc=l hifc=n 

For M > 1 the free algebra Tm is non-abelian and can at best be considered as the 
algebra of functions on a non-commutative manifold. Then Qm is a non-commutative 
analogue of the diffeomorphism group. 

Finally, the action of the automorphism group on probability distributions is given 
by the transformation rule for moments. If F/ = (^/) and ^ 0(0* > then 

Gi = mh ■ ■ ■ m^J = [4>*r]i,..,.^ = ct>ii ■ ■ ■ ^t^j.-j^ (4-17) 

However, only a subgroup of the automorphism group preserves the positivity of proba- 
bility distributions 

Gm = G QM\[4>*r]ijf*f-^ > V polynomials f{0} (4.18) 

Let us now characterize the infinitesimal version ol Qm, the Lie algebra Qm- 

An automorphism (j){A)i = Ai + v( Aj that deviates infinitesimally from the identity 
is a vector field, a derivation of the tensor algebra. Such a derivation can be expressed 
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as a linear combination of the basis vector fields Ly = v( where the action of the basis 
vector fields is 

m]i = (4.19) 

and by the chain rule, 

LKj = S'/''UiJ, (4.20) 
Taking expectation values, we get the action on moments 

[L]G]j = 6'/^'^UiJ, 

L\ = Gj.jj,—^ (4.21) 

The Lie algebra Qm of these vector fields can be worked out 

[L}, LS] = 6'/^'^L^j^,j^ - sY^'^Vj^n, (4.22) 

For M = 1 it reduces to the Lie algebra of polynomial vector fields on the real line. Con- 
sider the infinitesimal changes of variable 4>{x) = x + ex^^^, A; = 0, 1, 2 . . .. (translations, 
corresponding to /c = — 1 are not allowed since they are not invertible.) Thus a basis for 
the Lie algebra is 

Lfc = x''^^^ = x^D, k = l,2... (4.23) 
ox 

and they satisfy the commutation relations of the Virasoro algebra [Lp, Lg] = (q—p)Lp^q 
(see also Appendix IB. 1(1 . So far, we have been thinking of the Lie algebra as acting on 
the tensor algebra. We can also think of the Lie algebra as acting on the group itself. 
By taking either ^ or (/> to be infinitesimal {x + ex'''^^) in the composition law tpocp for a 
pair of group elements, we get the left and right actions of the Lie algebra on the group 
Gi. The left action is iC''(p){x) = (t)^{x): 

C}K= E ^h---K- (4.24) 

ilH h/fc=n, 

For a physical application of this see HB.11|) . The right action is: T^^{x) = x^D(^(x). 
They both satisfy the same commutation relations as the L^,. 

The reason we studied how probability distributions transform under a change of 
variable is that it gives us a novel way of understanding the entropy of non-commutative 



Chapter]^ Free Algebras <fe Free Probability 



89 



probability theory. It is also essential to understand the classical action principle of 
large- matrix models (see ^5.2|) . 

4.4 Free Entropy of Non-Commutative Probability Theory 

In ^5.31 entropy of operator- valued random variables is discussed from the point of view 
of a variational principle for matrix models and the cohomology of the automorphism 
group of the free algebra. Here, we present Voiculescu's precise definition JW! in terms of 
matricial microstates. The free entropy x of a collection of self adjoint random variables 
Xi, • • • , Xn in a W* probability space {A, r) is the logarithm of volume of a set of her- 
mitian N x N matrices Ai, the matricial microstates. The set of matricial approximants 
are restricted by requiring all their N ^ oo moments to equal those of the random 
variables Xi. The volume measure on the space of hermitian matrices is the Lebesgue 
measure A corresponding to the Hilbert-Schmidt (HS) norm J tr{Al + --- + Al). Let 
r/j(Xi, • • • , Xn',m, N, e) be the set of n-tuples {Ai, ■ ■ ■ , An) with 

\^Aj-t{Xi)\ < e V|/|<m (4.25) 
Successively, the regulators are removed iV^oo,m^oo,e— >0, i?, ^oo 
XRiXi,- ■ ■ ,Xn;m,N,e) = logX{Tji{Xi,- ■ ■ ,Xn;m,N,e)) 

1 Tl 

XniXi,- ■ ■ ,Xn;m,e) = \imsup{—jXRiXir ■ ■ ,Xn;rn,N,e) + -\ogN) 

N~,oo A 2 

Xr{Xi,- ■ ■ ,Xn) = mi{xRiXi,- ■ ■ ,Xn;m,e)\m £ Af,e > 0} 

x{Xi,---,Xn) = snpxR{Xi,---,Xn) (4.26) 
ii:>o 

The cut off R plays a minor role, XR is independent of R once R exceeds the HS norms 
of all the Xi. Some properties of free entropy x that Voiculescu establishes are listed 
below. See jHS] for additional results. 

1. The entropy of a single semi-circular variable S with t(S'^) = 1 is xi^) = ^ log27r. 

2. x{Xi, • • • , X„) < f log (^) where = T{J2i=i Xf). The RHS is the entropy of 
the standard Wigner distribution. 
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3. n = 1: If X has distribution /i, then x{^) is the logarithmic energy 

X{X) = J log \s - t\d^l{s)d^l{t) + I + i log 2^ (4.27) 

4. Semi-circular maximum: If T{Xf) = 1 V i then xi^i , • ' " > is maximal iff the 
Xi are semi-circular and freely independent. Thus the semi-circular distribution 
plays the role of the gaussian of ordinary probability theory. 

5. Additivity under Preeness: If Xi are pairwise freely independent, 

n 

xiXi,---,Xn) = J2x{Xi) (4.28) 

i=l 

This is the analog of the additivity of entropy of statistically independent random 
variables in usual probability theory. Statistical independence is replaced by free- 
ness here. This explains the choice of the logarithm of the volume rather than any 
other function of volume in defining entropy. 
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Chapter 5 



Euclidean Large- Matrix Models 



We will consider matrix models, whose random variables are a collection of AI N x N 
hermitian matrices, Ai, i = 1, . . . ,M. (Appendix El contains a rapid summary of some 
facts about the M = 1 matrix model.) We can think of them as the gluon field at a finite 
number of points in space-time labelled by i. The i could also label links of a lattice as 
in lattice gauge theory, in which case it is more convenient to consider unitary matrices. 
The matrix elements are distributed according to 

-TV tr 5(A) dA 

z{s) ^^-^^ 

dA stands for the usual Lebesgue measure on the matrix elements of the M hermitian 
matrices. The theory is defined by the action S{A), which we take to be a polynomial 

S{A) = J2 S''-'"A, ■■■A^ = S'Ai (5.2) 

n 

with cyclically symmetric tensors , the coupling constants. (Note: We will use capital 
letters to denote strings of indices Ai^ ■ ■ ■ Ai^ = ^i^.-.i^ = Af and sum repeated indices.) 
The partition function is 

Z{S) = j e-''''^^'^UA (5.3) 
and the free energy is defined as 

F{S) = -^\ogZ{S) (5.4) 
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The expectation value of any function of the matrices is 

if (A)) = zls)J ^(^^"""^ ""^^^^^ ^^-^^ 

The observables we are interested in are those that are invariant under the simultaneous 
action of U{N), Ai UAiU\ i = 1, . . . , M. We can think of these as the global gauge 
transformations that remain after local gauge fixing. The simplest examples of invariants 
are = ^Ai known as loop variables. The factor of is inserted with hindsight to 
ensure a good large- limit. Also of interest, are the Wilson loop-like observables 77 e"^^, 
though these can be built out of the loop variables by expanding the exponential in a 
power series. 

5.1 Loop Equations, Classical Configuration Space 

The loop variables satisfy Schwinger-Dyson equations 

S'^''H^j,ij,) = 5\^''^{^i,^i,) (5.6) 

which are the conditions for the invariance of the partition function under an infinitesimal 
but nonlinear change of integration variable Ai ^ Ai + vlAi. In other words, under the 
action of the vector field = vlL\. The basis vector fields act as L\Aj = SjAj and by 
the Leibnitz rule, L}Aj = dj^^^^Aj^jj^. There are contributions from both the change 
in the action and the change in the measure. 

1 - Nv{ tr L'jSiA) 

= j dA{^ + tr Aj^ tr Aj^^ (^1 - Nv{S-^''^^ tr Aj^u^ e'^ ^%.7) 

where we have used det (1 + vT) ~ 1 -1- v tr T. Comparing coefficients of and dividing 
by Z{S)N'^ gives the advertised Schwinger Dyson equations. The latter are not a closed 
system of equations for the expectation values of the loop variables. They are related 
to those of a product of two loop variables. Migdal and Makeenko |^ noticed that in 
the large-A^ limit of 't Hooft [Ml , holding the coupling constants fixed, the loop 
variables satisfy a closed system of equations, known as the loop equations or factorized 



Z{S) 



dA 



det 



d{Ai + viAi)l 
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-N tr S{A) 
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Schwinger-Dyson equations. This is because, in the large- hmit, the loop variables do 
not have any fluctuations, and their expectation values factorize 

(|^4^/.> = (|^/.)(^^/.> + 0(^) (5.8) 

This property can be proven perturbatively. Thus the large-iV limit is a classical limit 
and it suffices to restrict attention to the gluon correlations or moments 

Gj = hm (1 tr Aj) (5.9) 

which are cyclically symmetric tensors. In terms of them, the factorized Schwinger-Dyson 
equations are 

S'^'''^'Gj,ij, =rfj = 5\''''Gi,Gi,. (5.10) 

What is the classical configuration space? It is the space of allowed moments. From 
the definition of Z{S) we see that G0 =< ^1 >= 1. Hermiticity of Ai implies G*j = Gj, 
where I = in ■ ■ ■ ii ii I = ii ■ ■ ■ in- Moreover, if f{A) = f^Aj is a polynomial, then the 
expectation value of its absolute square should be positive, f"^ (A) f {A)) > provided 
the integrals converge. In other words, Gjjf^* f '^ > which is the same as the positivity 
of the Hankel matrix Gj-j = Gjj. This imposes certain inequalities on the moments, 
which are known as the moment inequalities. We see that the configuration space of 
large- matrix models is the space of non-commutative probability distributions in M 
generators Vm, as defined in H4.21 The moments Gj are coordinates on this configuration 
manifold. Thus the moments of large-A^ matrix models provide an example of non- 
commutative probability distributions. If we associate with each matrix Ai a generator 
of the free algebra ^j, then Gj =< >. 

In commutative probability theory of real- valued random variables, the moments 
Gk = Jx^p{x)dx determine the probability distribution p{x) up to technicalities 0. 
But there could still be observables whose expectation values are not easily expressed in 
terms of the moments; an example is entropy / p{x) log p{x)dx. In matrix models, the 
large- limit of the free energy F = limTv^oo —^^ogZ is of much physical interest. 
Though in principle it is determined by the moments, we will see that there is no simple 
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formula for it in terms of the moments. Indeed, this subtlety is quite central to our entire 
discussion and we will return to it. 

The loop equations (jS-lUf) are quadratically non-linear in the moments and in general 
couple all the moments to each other. The LHS, S'^'^^^'^Gj^ij^ comes from the expectation 
value of the variation of the action S{A). The RHS 77} is an anomaly in the sense that 
it comes from the change in integration measure ISHl- It is a universal term since it is 
independent of the coupling constants . We can regard these as the classical equations 
of motion. For a single matrix (M = 1) these equations are recursion relations and 
can be solved (see Appendix^. There are a few special choices of action, such as the 
gaussian, where S is a quadratic polynomial (see Appendix[Oand Ref. 1751 1^ I5T] ^ 
for which the partition function or some special classes of moments can be obtained 
exactly. But in general, even after the simplifications of the large-A^ limit, the loop 
equations of a multi-matrix model have not been solved exactly. The situation is not out 
of the ordinary, since the generic classical system is not exactly solvable, it is understood 
by approximation methods such as perturbation theory, mean-field theory, variational 
principles etc. Perturbation theory around the gaussian leads to the expansion in terms 
of planar Feynman diagrams |^. In the case of finite matrix models, the perturbative 
series is expected to have a finite radius of convergence in the coupling constant and 
one might hope for a purely perturbative construction of the theory 52_ . But there are 
limitations to this approach due to the renormalon singularities upon passage to full- 
fledged field theories Moreover, as we saw in two-dimensional QCD, there are many 
interesting phenomena that are either inaccessible to perturbation theory or more easily 
understood by other methods. Understanding the large-A^ limit of matrix models as 
bona-fide classical mechanical systems will permit us to adapt the methods of classical 
mechanics to this new class of dynamical systems. 
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5.2 Classical Action Principle 
5.2.1 Anomaly as a Cohomology Class 

One peculiar feature of our derivation of the classical equations of motion is that we got 
them sans any knowledge of the classical action! The original action S{A) will not do 
since its variation will not give the anomalous term r/} which is independent of coupling 
constants and came from a change in the measure. We look for a classical action Q{G), 
the conditions for whose stationarity under the vector fields L} are the loop equations 

Vjn{G) = -S'^^'^'Gj.ij, + 6'/''^Gi,Gi, (5.11) 

The action of these vector fields on moments is given by [UjG]j = 6j^^'^^Gj-^jj^. The Lie 
algebra (which we denote Gm) of these vector fields is (see @ 

[L], L^j] = Sj^^-'^L^j^.j^ - 6'/^'^L}^jj^ (5.12) 

Equivalently, we can think of them as first order differential operators on the configura- 
tion space 

L} = Gj,jj,—^ (5.13) 

The action dependent term is clearly the variation of {jj times) the expectation value of 
the original action S{A) 

L\{S'Gj) = S'^''^Gj,u,. (5.14) 
So let VL{G) = -S^Gj + x{G) where 

ViX{G)=rtj = 5\^''^Gi,Gi, (5.15) 

This is a system of inhomogeneous first order linear PDEs on the configuration space. 
Necessary integrability conditions (though not sufficient) for a solution x{G) are: 

L}r?S - m - 5}'H,IJ, + ^\''H,JI, = 0. (5.16) 

These complicated looking conditions simply state that r/} are the components of a closed 
one-form. They look complicated because the L\ are a non-commuting basis for vector 



Chapter\^ Euclidean Large-N Matrix Models 



96 



fields unlike the coordinate basis g^-j- Define the 1-form r] such that evaluated on the 
vector field L} it gives r][L\) = r/}. Then the condition for rj to be closed {drj = 0) is 

(dr?)(L}, LS) = L}r?(LS) - L^ML]) - ry([L}, L^]) = 0. (5.17) 

When written in components we get ()5.16|1 . We show in Appendix^that the integrability 
conditions are satisfied. So r/ is indeed a closed 1-form! 

In the same vein, (|5.15j) becomes r] = dx- Is r/ an exact one-form? The answer to 
a question on exactness depends on the class of functions on Vm from which x{G) can 
be chosen. We could not find any formal power series in the moments whose exterior 
derivative is r]. Formal power series are natural since the other term in the classical 
action (S^Gj) is such a function. To see why x{G) is not a power series in the moments, 
it suffices to consider the one-matrix (M = 1) case where x = / log \x — y\p{x)p{y)dxdy 
cannot be expressed as a series in the moments Gk = J x^p{x) dx on account of the 
non-analyticity of the logarithm at the origin (see Appendix Ib)). Unfortunately, there 
is no straightforward generalization of this one- matrix formula for to M > 1. The 
notion of a probability density p(x) makes sense as the density of eigenvalues of a single 
matrix. But when we have several matrices that do not commute, there is no meaning 
to their eigenvalues since there is no common basis in which they are all diagonal. The 
basis independent notion of a joint spectrum is then the collection of moments Gj. 

We conclude that the anomaly r/ is a closed but not exact one-form, in other words 
an element of the first cohomology of the Lie algebra of vector fields Qm valued in the 
space of power series in the moments. But t] is just the infinitesimal version of xiG) 
iv = dx)- It is therefore natural to expect xiG) to be an element of the first cohomology 
of the group Q whose Lie algebra is Gm_. Now Qm is spanned by the vector fields L} 
which are just infinitesimal automorphisms of the tensor algebra on M generators (or 
free algebra) Tm (see Therefore Gm must be the group of automorphisms of the 
tensor algebra AutT/vf. 
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5.2.2 Configuration Space P as a Coset Space Q/SQ 

More concretely, to find a formula for x{G), we must find some other way of describing 
functions on the configuration space Vm^ since power series in Gj were inadequate. To 
do this it helped us recall that ry was the change in the matrix integration measure under 
an infinitesimal change of variable. This suggests another way of parameterizing Vm 
- in terms of the finite (invertible) change of variables (j) that maps the moments of a 
reference probability distribution Tj to those of the probability distribution of interest 
Gi = [(/>*r]/. Moreover, x{G) should be the change in measure under a finite change of 
variables. 

But such a change of variables is nothing but an automorphism of the free algebra 
Ta/ ! We see that the configuration space Vai carries an action of Gm = Aut Tm and this 
action is transitive if we restrict to those distributions which can be obtained by such a 
change of variable starting from T. However, there may be more than one automorphism 
that maps F to G. Suppose SQm is the isotropy subgroup for this action, the subgroup of 
measure-preserving automorphisms which map T to itself. Then the space of probability 
distributions Vai is the coset space Gm/SQm- 

Returning to xiG), how does this coset space description of the configuration space 
get around the cohomological obstruction to expressing xiG) as a power series in the 
moments? We have a new way of thinking of functions on Vm'- functions on Qm invariant 
under the action oISQm^- In particular, power series in the Gj can always be expressed in 
terms of 0f by substituting G/ = [0*r]/. But there are power series in the 4)1 (functions 
on Qm) invariant under SQm which cannot be expressed as power series in the Gj. 
Indeed, we find that x{G) is one such function! 

Let us explain our implementation of the rough ideas outlined in the last few para- 
graphs. Suppose (/) is a transformation of the matrices 

[A,]i ^ = <^r-'"K]a. • • • [Ajr' (5.18) 

which is the finite version of the infinitesimal transformation Ai Ai + v( Aj associated 
with the vector fields Lj (see ^5.1|1 . (p defines an automorphism of the free algebra 

(5.19) 
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provided its linear part (p^ is an invertible M x M matrix. Let Tj be the moments for 
the standard action So{A) and Gi those for action S{A). Suppose they are related by 
the change of variables So{A) = S{(f){A)). Under such an automorphism, the moments 
and free energy F{S) = — lim^v^oo 777 log Z{S) transform as 

Gi,...i„ = [(A*r]i,...i„ = • • • </'/:rj,...j„ 

FiS) = F(5o) + lim < --^ tr log J >5o (5.20) 

where J = is the jacobian matrix for the transformation and O^o denotes expec- 

tation value with respect to the action Sq. To see this, consider 

Gi = lirn^y dyle-^*'^^(-^)^A,|/| J dAe"^ ^(^) | (5.21) 

Make the change of variables Ai = (l)i{A) 

^ ^ 1-^ /didet(J)e-^^-^°(-^)|0.,(i)-->.„(i) 
^ /didet(J)e-^t=-^o{i) 

j.^ JdAe-^'^'o(^)^cP,,M)...<p,„iA) 
N^oo J dAe-^ ^oiA) 



rj,---rj„ (5.22) 



where we have used the factorization of correlations in the large- limit to cancel the 
jacobian determinants. Similarly, 



N tr 5o(A) 



Z{S) = j dAe-^''^'^^^ = JdAdet{J)e- 

^log{Z{S)/Z{So)} = ^log<e*^'°s(^)>5o 

F{S) - F{So) = - lim < ^ tr log J >5o (5.23) 

where in the last step we have used large- factorization to reverse the order of the 
logarithm and expectation value. 

However, not all automorphisms preserve the positivity of the Hankel matrix Tj-j = 
Tjj. The subset that do is ^a/ = S Gm \ [4>*'^]iii2 ^ positive matrix | We will largely 
ignore the somewhat technical difference between G and Q. (Does Gm have the same Lie 
algebra as ^m?) 

Now we will think of any probability distribution of interest, Gj as being obtained 
from a fixed reference distribution Tj by the action of a change of variable (p G Gm- 
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Can every probability distribution be obtained in such a manner? Here we show that 
any distribution in the infinitesimal neighborhood of a given Ti can be obtained by an 
infinitesimal automorphism, a derivation v = v(Uj. To see this, let Gj = F/ + STj 
be a distribution that differs infinitesimally from our reference Tj. Suppose T satisfies 
the strict moment inequalities, i.e. F/.j = Tjj is a positive Hankel matrix so that 
it is invertible T^'^'^Tj-k = ^k- ^"^^ cyclically symmetric tensors (preserving the 

moment inequalities). The question is, can we find a vector field = vjUj such that 
[L^F]/ = (5F/? Let I = ii ■ ■ ■ in- Then 

— Vj^X ji2---i„ + v^^Ti^ji^...i^ + • • • 7;j^Fjj...j^_-^ J 

= vlTji2...i„ + viT ji^...^^^, + ■■■ + <Fj,,...i_, (5.24) 

Given (5F, pick any tensor w whose cyclic part is (5F, 

Wii-i„ + Wi2-iuh H Wi^h-i^.i = 5^h-i„ (5.25) 

and put wj = vfVjj. Since F/.j is positive, we can invert it and solve for v:^ 

^vf = w;/F^'^ (5.26) 

It follows that [Lt,F]/ = STj. Thus, we have found an infinitesimal change of variable 
V which allows us to transform from our reference F to any given distribution in its 
infinitesimal neighborhood. In other words Gm acts transitively on the tangent space to 
the classical configuration space at F. 

We have not investigated whether this action of the Lie algebra can be exponentiated 
to a transitive action of the automorphism group on Vm- Though it seems likely, the 
necessary change of variable may turn out to be singular if the probability distribution 
G is qualitatively very different from F, this may be the sign of a "phase transition". In 
that case we will need one reference distribution for each "universality class" of matrix 
models. 
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What is more, for M > 1 there are many changes of variable that transform T into G. 
They differ by those automorphisms that fix F, the isotropy subgroup of measure preserv- 
ing automorphisms SQ. Infinitesimally, the isotropy Lie algebra consists of derivations 
V that leave T invariant 

[L^r]i = for ah I 
=> = vlTji,...i„+vlTj,^...i^i, + --- + viTM,...i^_^ (5.27) 

A somewhat more explicit characterization of this isotropy Lie algebra when the reference 
r is the multi-variate Wigner distribution is given in Appendix O 

The upshot is, we can regard the configuration space of large- matrix models as a 
coset space of the automorphism group of the free algebra: Vm = Gm/SQm- 

5.2.3 Finite Change of Variable: Formula for x QjSQ 

We now use the coset space characterization of the configuration space to solve for x{G) 
such that UjxiG) = rfj. Suppose L is a reference probability distribution and (/> is a 
change of variable that maps T to the distribution G. We shall express 

x(G')-x(r) = c((/>,r) (5.28) 

where c((/), F) is a function on the group Q. But the LHS is independent of the "path" 
(j) that connects T to G. So to make this a sensible equation, we show that c{(j)^ T) is 
invariant under the action of the isotropy subgroup SQ. Then it will really be a function 
on the quotient V = Q /SQ, which is the same as the space of moments. 

Recall f H5.2.1() that r] is an element of the first cohomology of the Lie algebra Q_ 
valued in the space of power series in the moments. By naturality, x{G) — x(r) must 
be in the first cohomology of the group Q, valued in the space of formal power series in 
the moments (See Appendix El on group and Lie algebra cohomology). In other words 
c((/), r) must be a non-trivial one cocycle. The analogue of dr/ = is the cocycle condition 
in group cohomology. Let (j) and ij) be two automorphisms. The cocycle condition is 



c((/)V',r) = c(</.,V',(r)) + c(V',r) 



(5.29) 
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Our answer for c{(j), T) must satisfy this transformation rule. 

Recall that r] is the expectation value of the jacobian for an infinitesimal change of 
variable (/'i(C) = Ci + Since rj = L\{x{G) — x(r)) is the infinitesimal version of x we 
guess that c((/), T) is the expectation value of the logarithm of the jacobian determinant 
for a finite change of variable cf). Why logarithm? For, the addition rule for the logarithm 
of a product will translate into the cocycle condition. 

The jacobian matrix for a change of variables (plAj ()5.18() . is J(0, A) = ^^§^- Under 
two successive changes of variable, the jacobians multiply 

J((/>V, A) = J{ct>, V'(A)) J(V', A) (5.30) 

Then it((^, A) = -^logdet J{(p, A) satisfies a{(j)'ip,A) = a{(j),ip{A)) +a{tl;,A). Taking 
the expectation value with respect to the reference distribution T we define 

c(4>, r) =< aU, A) >=< 4tt log det J > (5.31) 

and the cocycle condition (|5.29j) is automatically satisfied. 

We will get a formula for c{(p,T) as a power series in (pf and Tj. The simplest 
possibility is a linear change of variable, [(/)(A)]j = [0i]^j4j. In this case, a{(j),J) = 
log det 01. Since (p is invertible, this term is always non- vanishing, and we factor it out 

l^l>2 



We show in Appendix El that a{(p, A) can be written in terms of the loop variables 

tr 



oo 



a{<P,A) = logdet<Ai + E ^^'^f^*^^^'^.?^^^^ • ■■4>Z-''''"^K,-K^^L„-L,. (5.33) 



n=l 

Taking expectation values and using factorization in the large- limit 

n 



c(0,r) = logdet<^i + E ^-^4>f,''''''4>Z'''''' ■ ■ ■ ^t''''-VK,...KjL,.-u. (5.34) 

n=l 

Now let (f) be an infinitesimal automorphism, (^{{A) = Ai + v^Aj where are small. 
Then [(t)i]j = + vj, 4>i = vj and only the n = 0, 1 terms in the infinite series for c 
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survive 

c{A, + v'.Aj, r) ^vl+ ^i'^'^J^K = E yWi'^'^iJh = vWi (5.35) 

|J| + |i^|>0 |/|>0 

So c((/), r) reduces to rj for infinitesimal so if we define x{G) = x(r) + c((/), F), we are 
guaranteed that UjX = ^/ which is what we wanted all along! 

To complete our construction of a classical action, we need to show that c(0, T) is 
a function of F and G = </>*F alone and is independent of the choice of (j) that maps 
one to the other. In other words, that c(0, F) is actually a function on V = Q/SQ. Is 
c{(j),T) invariant under the right action of the isotropy subgroup? Suppose ip G SQ, i.e. 
■0=kF = F Then we need to show 

c((/.V',F) =c((/.,F) (5.36) 

But this follows from the cocycle condition (|5.29|) if we can show that 

c(V',F) = if V*r = F (5.37) 

We establish this at the level of Lie algebras. In other words, suppose il}i{A) = Ai + v(Ai 
is infinitesimal, and fixes the distribution F. Then v(UjTj = VJ. On the other hand, 

c(V',F) = vlv} 

= viL}{SiTj) 

= S^iviL]Tj) 

= 5^(0) = (5.38) 

In the second equality we have used the factorized Schwinger Dyson 1)5. 1U|) satisfied by 
the reference action Sq and reference moments F. Thus c{(j), F) is invariant under the 
infinitesimal action of SQ. It should be possible to exponentiate this to an invariance 
under the full isotropy subgroup though we do not address this issue here. Thus we 
regard c{(p,T) as a function on the coset space V = Q/SQ and 

X{G) = x(F) + log det c/>i + 5: i^^^i<^^^^^3L2 . . . ~^ff^L^TK,...KjL^-L, • 

n=l 

(5.39) 
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We have finally solved our problem of finding a classical action for large- multi- 
matrix models! Suppose the distribution G is obtained by a change of variables (j) applied 
to the reference distribution F. Then extremization of ^{G) = —S^Gj + x{G): 

oo 

o(0,r) = •••</)/:rj,..j„+x(r) + iogdet,/.l,. 

n=l 

+ E ^^'^.'I^^^^V?'^^^ • • • <^,^"^"^"rK„...i.,rL,...^„ (5.40) 

n=l 

gives the factorized Schwinger Dyson equations (|5.10|) . This variational problem is equiv- 
alent to computing the partial Legendre transform of xiG): extremize x holding fixed the 
moments Gj which are conjugate to the couplings that appear in the action. The cou- 
pling constants play the role of Lagrange multipliers. Furthermore, ^2(1;^', T) = 
actually lives on the quotient V = Q /SQ. On account of rj not being an exact one-form, 
we could not find a power series in the moments Gj for x- But we have found auxiliary 
variables (0, T) that allow us to accomplish the task of finding a classical action. 

In principle, the problem of solving a large-A^ matrix model, given the coupling 
constants in its action, reduces to the extremization of $7 with respect to the change 
of variable (p for fixed reference moments Tj. Once the optimal (j) is determined, the 
moments are given by 

Gi,...i„=c^^---c^^Tj,...j^ (5.41) 

Furthermore, recalling the transformation rule for the free energy under an automor- 
phism (|5.20|) . the large- free energy in terms of the optimal (j) is 

F{S) - F{Sq) = - c{4)optimah^) 

=^ F{S) = — constrained extremum of x(G) (5-42) 

where the constrained extremum is with respect to all moments except the ones conjugate 
to the couplings . Also, Sq{A) is the reference action with moments F/. If tr S{A) is 
an action that grows as a polynomial as tr — > 00 so that the matrix integrals converge, 
then the extremum of Vt is actually a maximum and the free energy is a minimum. 

A byproduct of this is a variational approximation method for large- A^ matrix models. 
Rather than extremizing O with respect to all possible E if we choose an ansatz 
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for (j) with a few variational parameters and maximize 0, we will get a variational upper 
bound on the free energy and a variational estimate for the moments! As an application 
we use this method to solve a two matrix model approximately in H5.4I But before that, 
we explain the physical meaning of x- It is the entropy that comes from our lack of 
knowledge of matrix elements when only the invariants <!>/ = 77^7 are observable. 

5.3 X an Entropy 

The appearance of the Legendre transform, the logarithm of the jacobian for a change in 
volume measure and the restriction of observables to U{N) invariants should all remind 
us of entropy in thermodynamics. The non-exactness of r] is reminiscent of the heat 
one- form in thermodynamics j23j . 

In thermodynamics, entropy is a measure of the number of microstates that have a 
given value for the macroscopic observables such as energy, pressure or volume. Entropy 
arises when we restrict the set of allowed observables of a physical system. We should 
expect there to be an entropy due to color confinement in the strong interactions: quarks 
and gluons are confined and the only observable particles are their hadronic bound states. 

In a matrix model, which is a watered-down version of gluon dynamics, we have 
restricted the class of observables from matrix elements [Ai]^ to U{N) invariants 
There should be an associated entropy. Indeed it is the term x in the classical action 
we have just obtained. It is the average of the logarithm of the volume of the space of 
matrices that have a given set of moments. It can also be regarded as the expectation 
value of a matrix model analogue of the Fadeev-Popov determinant of gauge theories. 
Despite the analogies with statistical mechanics, there are a few differences. The concept 
of thermal equilibrium is not relevant to matrix models. What is more, we have an infi- 
nite number of macroscopic observables Gj while there were only a finite number such as 
energy, pressure, density, volume, and chemical potential in thermodynamics. Further- 
more, the microscopic dynamical variables, the matrices, do not commute as opposed 
to the real-valued positions of particles in a gas which commute. As a consequence, the 
entropy in matrix models is that of non-commutative probability theory rather than the 
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usual Boltzmann-Shannon entropy of commutative probability theory. Nevertheless, the 
Boltzmann notion that the entropy should be the logarithm of the volume of microstates 
corresponding to a given macrostate continues to makes sense. 

The interpretation of x ^ the entropy is most transparent for the case of a single 
matrix. The integral over the matrix elements of a hermitian matrix A = UDU'^ can 
be factored into an integral over its eigenvalues D = diag(Ai, • • • , A^v) and the unitary 
transformation U that diagonalizes it. The jacobian determinant for this change of vari- 
ables is the volume of the space of hermitian matrices which share a common spectrum 
Ai < A2 < • • • \n where the eigenvalues are generically distinct. The well known result 
dlis 

dvl = [] [| d YleAl d \uiAl = Vol{U{N)) [| dX^ (5.43) 

a. a<b a 

where Vol(U{N)) is the volume of the unitary group and A is the Van der Monde 
determinant A = na<b(-^a ~ ^^b)- Then the logarithm of volume of matrix elements cor- 
responding to a given spectrum is x = log A'^, up to the additive constant log Vol{U{N)) 
which we ignore since it is independent of the eigenvalues. If p{x) = J^a ^(^ ~ ^a) is 
the eigenvalue density, then the entropy is 

X = 2^1og|Aa-Afe| 

a<b 

= "P y p{x) p{y) log \x — y\ dx dy (5.44) 

where we have used the principal value in anticipation of the passage to the large- 
limit where we have a continuous distribution of eigenvalues. Notice that this formula 
for entropy is different from the Boltzmann entropy — / p{x) log p{x) dx of a real-valued 
random variable with density p. 

Now suppose Gk = J Pg{x) x^ dx are the moments of the probability density pc and 
Ffc are the moments of a reference distribution p^. Furthermore, let (j) be the change of 
variable that relates the two, 

Pt{x) = pg{4>{x))(I)' {x) 

Gk = j PT{y)cl>Hy)dy (5.45) 
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then the above formula for entropy becomes 

X{G) = V dxdy pG{x)pGiy) log \x - y\ 



x{G) = x{^)+'Pj dxdy Prix) priy) log 



(j){x) - (j){y) 



(5.46) 



x-y 

where we have multiplied and divided the argument of the logarithm by |x — y| so that 
we may isolate x(r). The second term on the right has the physical meaning of the 
relative entropy of G with respect to T. Moreover, if we assume the change of variable 
is an invertible power series 

oo 

(pix) = (pi[x + 4)ix)]; (f)i > 0; 0(x) = ^ (5.47) 

k=2 

the entropy becomes 

x{G)=x{T)+log(Pi + J2- E 4>kr+l+h---4>kMU'^kr+-kSh+-+ln (5-48) 

n=l ki+li>0 

We see that this is just the M = 1 case of the formula for x we obtained earlier 1)5. 39(1 
in looking for a classical action for large- matrix models. Formula (|5.46l) for entropy 
in terms of an eigenvalue density has no simple generalization to several non-commuting 
matrices. There is no common basis in which the matrices are diagonal. However, the 
joint spectrum of non-commuting matrices makes sense if we think in terms of the basis 
independent moments. The series in Taylor coefficients of for entropy continues to 
make sense. This is what we obtained in ()5.39|) . Thus c{(j),T) = xiG) — x(r) has the 
physical meaning of relative entropy of G with respect to T. For more on entropy of 
non-commutative random variables, see ^4.41 

In light of this, our variational principle from N5. 2. 31 has a simple physical interpre- 
tation. The moments of the large- classical limit of a matrix model are determined by 
maximizing entropy holding the moments Gi conjugate to the coupling tensors fixed 
using Lagrange multipliers . In looking for a variational principle for matrix models 
we have naturally been lead to the cohomology of automorphisms of the free algebra and 
to the entropy of non-commuting random variables! Let us now return to our original 
problem of determining the moments and free energy of large- matrix models. 
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5.4 Variational Approximations 

We next discuss the use of our variational principle to obtain approximate solutions 
of large-A^ matrix models. (For other approximation methods in the literature, see for 
example |H1] ) • A linear automorphism with the wignerian reference leads to an analogue 
of mean-field theory for matrix models. We closely follow our presentation in [51 169j . 
Consider a quartic multi-matrix model with action 

5(M) = tr i^K'U,, + ^g'^^'A.ki]. (5.49) 

Let us use the standard multivariate Wigner distribution (Appendix [nj with action 
5*0 = ^d''^ AiAj as the reference distribution. The reference moments are Tij = 6ij,Tijki = 
^ij^ki + Sii5jk etc. Let E = F{S) — F{So) denote the free energy of 5 relative to Sq. For 
a linear change of variables (pi (A) = <j)l Aj . We must maximize 

= tr logical] -^K'^G^,-^g'^''G^,M. 
^ij = 4>^4''ji Gijki = GijGki + GiiGjk (5.50) 

Thus Gij may be regarded as variational parameters and the condition for an extremum 
is 

^KP'^ + ^b^^^'Gn + g'^'^'Gi, + g^'^'^^k + g'^'^Gu] = ^iG'^r (5-51) 

This non-linear equation for the variational matrix G, is reminiscent of the self consistent 
equation for a mean-field. We will use the terms mean-field theory, gaussian ansatz 
and wignerian ansatz interchangeably for this variational approximation. To test it, we 
consider a two matrix model for which some exact results are known from the work of 
Mehta |7H]. 

5.4.1 Two Matrix Model with Interaction f (A^ + 5^) - 2cAB 

We will study the two matrix model whose action is 

S{A,B) = -tr[^{A'^ + B'^-cAB-cBA) + ^{A^ + B^)]. (5.52) 

Kazakov relates this model to the Ising model on the collection of all planar lattices 
with coordination number four |65j . The partition function of this model has been 
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calculated exactly by Mehta [TSj. This is a special case of the chain matrix models 
5(^1, • • - An) = X]fc=i ti' ^i^k) — J2k=i tr AkAk+i which also share this integrable 
property. Indeed, as A; oo, we can replace it by a continuous index. When the 
continuous index is thought of as time, we get the quantum mechanics of a single matrix 
whose ground-state was obtained by Brezin et. al. jl8j by mapping it onto a system of 
free fermions. 

We restrict to |c| < 1, where K^^ is a positive matrix. Since S{A^B) = S{B,A) and 
Gab = G*Qj^ we may take 

fa p\ 

G, ^ J (5.53) 

with a, (3 real. For (7 > 0, is bounded above if Gij is positive. Its maximum occurs at 
(a, 13) determined by /? = i_^2ga ^'^'^ 

Ag'^a^ + Aga"^ + (1 - - 25)0 -1 = 0. (5.54) 

We must pick the real root a{g, c) that lies in the physical region a > 0. Thus, the 
gaussian ansatz determines the vacuum energy {E{g, c) = — ^ log (a^ — and all the 
Green's functions (e.g. Gaa = ct, Gab = (3, Ga* = 2a^ e.t.c) approximately. 

By contrast, only a few observables of this model have been calculated exactly. Mehta 
|78j obtains the exact vacuum energy E^^{g,c) implicitly, as the solution of a quintic 
equation. and may be obtained by differentiation. (Generating series for some 
other special classes of moments are also accessible (see j971 131j ^.) As an illustration, we 
compare with Mehta's results in the weak and strong coupling regions. For small g and 

^'"(9, \) = --144 + 1.78(7 - 8.74^2 + . . . i?--(5, \) = -AAA + 3.565 - 23.75^ + • • • 

G7b{9, \) = \- 4-7% + 53.335^ + • • • GTb{9. \) = \- 4-745 + 48.465^ + • • • 
1 32 1 32 

G7aaa{9. 2) = Y - 34-965 + • • • GTaaa{9^ 2 ^ = T " ^^'^^^ + + ' ' ' 

For strong coupling and arbitrary c : 

E-^g, c) = 1 log5 + ^ log 3 - f + - - - i?--(5, c) = ^ log5 + J log 2 + ^ + O(-) 
2 2 4 2 2 -^85 g 

G7b{9, c) ^ as 5 - 00 GTb{9, c) = f " 7^ + 0{\) 

25 (25)2 r 
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Gaaaa{9,c) = - + ••• 



G7Xaa{9,c) = i - ^ + 0{^) (5.55) 
9 2a) 2 r 



The linear change of variable from a wignerian reference gives a crude first approximation 
for both weak and strong coupling. It fails near singularities of the free energy (phase 



gaussian ansatz. This reinforces our view that the gaussian variational ansatz is the 
analogue of mean-field theory. 

5.4.2 Two Matrix Model with Af + - [Ai, Ag]^ Interaction 

Next, consider the two matrix model with Yang-Mills -|- gaussian type of action (m^ > 0) 



Certain observables in the large- limit of the gaussian -|- commutator squared model 
(such as the partition function, though not Gabab ~ this can be seen via the planar 
diagram expansion) are the same as in the gaussian -|- anti-commutator squared two 
matrix model. Its partition function has been studied analytically in |351 171j in relation 
to the three color problem on a random lattice. The partition function of this model also 
arises in the study of the M = 1* super symmetric gauge theory, the mass deformation of 
the AA = 4 supersymmetric gauge theory upon integrating out one of the three adjoint 
chiral matter fields i28|. We can also regard this matrix model as a simple model for 
Yang-Mills theory. 

It has been shown ^U] that the integrals over matrices for the pure commutator 
squared action for a two matrix model are not convergent. To see this, consider the 
partition function, and go to the basis in which Ai is diagonal. In this basis, the integrand 
is independent of the diagonal elements of A2, and therefore diverges. The divergence is 
even worse if we consider the expectation value of the trace of a polynomial involving A2, 
since then the integrand grows for large values of the diagonal elements of A2- Moreover, 
from the work of [HSl 1^ , it is expected that the free energy develops a singularity for 
sufficiently large (or small m^). Since there is only one independent coupling constant 
we will set g = 1. 



transitions). As |c| 



1 , the energy E^^ diverges; this is not captured well by the 




(5.56) 
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In j69j we did a small-scale Monte-Carlo simulation to test our wignerian variational 
ansatz for this model. This ansatz does a good job of estimating 2 and 4 point correlations 
for m? > 1 but not in the strongly-coupled region of small m^. It does not capture 
the divergence in free energy for small m?, as we would expect of a mean- field type 
approximation. Our numerical simulation is also not expected to be accurate for small 
m^. We quote here the results from |69j . 

Polynomial Moments 

For a linear automorphism with standard wignerian reference So = ^6^^AiAj 

1 77?^ 1 

n[G] = -logdet[Gi,] - — (Gn + G22) + ^{Guu - G1221) (5.57) 
As before the variational matrix of second moments is 

Gij=\ " ^ (5.58) 
\P a J 

Since < ^Af > > and < 7^(^i - ^2)^ > > 0, we must maximize 

n(a, P) = ^ log(a^ - P^) - m^a + ^{(3^ - a^) (5.59) 
in the region a > and a > p. We get 



Gu = G22 = a = Jl + - -, Gi2 = G2i=P = (5.60) 

Figures 15.11 and 15.21 compare the variational two point correlations with Monte-Carlo 
measurements for a range of values of m?. All other correlations can be expressed in 
terms of these. For example, the 4-point correlations are (the rest are determined by 
cyclic symmetry and Ai ^ A2 exchange symmetry) Gim = 2q^;Gi212 = 0;Gi22i = 
«^;Giii2 = 0. Figures 15.41 and 15.51 compare variational estimates (solid lines) and 
Monte-Carlo measurements (dots) of Gim, G1212 and G1122 for 10~^ < rn^ < 10^. The 
n point pure Ai (or A2) correlation is given by the Catalan numbers 

Ga«2 = ^nMi-n_i_-|N, a2 , it u IS evcu: , 
G111...1 = G222...2 ^ G(„) = <; ' (2)!(i+i)! ^ y (5.61) 

D, if n is odd. 
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Figure 5.1: log^g [^ii] versus log;^Q [m^] for g = 1- Solid line is variational estimate, dots 
are Monte-Carlo measurements. The approximation becomes poor for small values of 



m 



G- 
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1 2 3 m 



Figure 5.2: G12 versus log^^Q [m?] for g = 1. Solid line is variational estimate, dots are 
Monte-Carlo measurements. 




Log m 



Figure 5.3: log^o [Gim] versus log^g ["^^] for g = 1. Solid is line variational estimate, 
dots are Monte-Carlo measurements. 



T 2 3 m 



Figure 5.4: G1212 versus log^g ["^^1 fo^' 5 = 1- Solid line is variational estimate, dots are 
Monte-Carlo measurements. 
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Log G1122 




Log m 



Figure 5.5: log^^Q [G1122] versus log^g [^^] for g = 1- Solid line is variational estimate, 
dots are Monte-Carlo measurements. 

More generally, 



Gii...i22---2 
Gii...i22---211---122---2 



G 



("l){"2){"3)(™4) 



G(ni+n-i)G(n2)G(ni) 



~l~^(ni)^(n3)^(n2+n4) 



(5.62) 



We mention these since they are useful in estimating expectation values of Wilson loop- 
like operators. The variational estimate for free energy is 

' -,2- 



E„ 



F{S) - F{So 



log a = — log 



'1 + 



(5.63) 



Wilson Loop-like Operators 

It is also interesting to see what the wignerian ansatz says about the 2-matrix analogue 
of the expectation of the Wilson loop in the large- limit. See |29[ E] for a calculation 
of these observables without any approximations in the one-matrix models. 
Wilson Line: The simplest analogue is a "Wilson line" the analogue of the parallel 
transport along a line of length I in the Ai direction 



N-*oo N 



(5.64) 



For the wignerian ansatz, we get (using (|5. 611) 1 

\2k 



Wuneil) 



Ji{2l^) . 



n=0 
1 



1 37r 

3- cos (— 2ly/a) as / — > 00 (5.65) 

/7r(/^)2 4 
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Wilson Line 



Figure 5.6: Wuneil) for = 1, g = 1. Dots are numerical and solid line variational 
estimate. 



Here Jn{z) is the Bessel function of the first kind. Wuneil) is a real- valued function 
of real / since the odd order correlations vanish. Thus, for the wignerian ansatz, the 
expectation value of the "Wilson line" is oscillatory but decays as a power For 
small /, Wiine{l) ^ 1 — ^aP + — • • •. Figure EHl compares this ansatz with Monte- 
Carlo measurements for m? = 1. The behavior for small and moderate values of / is well 
captured by our ansatz. 

L shaped Wilson Line: For an L shaped curve, we define 

tr 



Wl{1) 



lim < > 

N^oo N 



(5.66) 



For the wignerian ansatz (use eq. (|5.62|) : iF2{a, b; z) = J2'^= 



-n /I, \ ri, \ r is a generalized 

Hypergeometric function with (a)„ the Pochhammer symbol). 



^ ^ nx\ni\ N^oo N ^ ^ 

n.i, 712=0 ^ ^ 



E 



fci!(A:i + l)!A;2!(fc2 + l)! 

4"+ir(n + 



fcl,fe2=0 

oo 

^ r(n + l)r(n + 2)r(n + 3) 
iF2(|{2,3};-4Z2a) 



As before Wl(0 is real for real /. For small /, Wl{1) ^ 1 — al H — 
compared with the numerical calculation in fig. 15.71 for w? = 1. 



(5.67) 
This is 



W^ilson Square: The analogue of the parallel transport around a square of side / in 
the Al — A2 plane is 



square 



(0 



lim < — e 



ilAi ilA2„-ilAi 



-ilA2 



> 



(5.68) 
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Figure 5.7: for = 1, g = I. Solid line is variational estimate, dots are Monte- 

Carlo measurements. 



Wsquare Wilson Square 
1» 




2 3 4 5 6 7 



Figure 5.8: Wsquarei^-) for "i^ = 1, g = 1. Solid line is variational estimate, dots are the 
Monte-Carlo measurements. 

In the wignerian variational approximation, Wsquarei^) is real- valued since odd order 
correlations vanish. Using eq. (|5.62j) we get 



n=0 n=0 



(5.69) 



where Ti (n) 
T2(n) 



E 

ki>0,ki-i \-k4=n 

E 

A;i>0,fciH hA;4=n 



2Cfcj Cfc2 Cfc4 n|_]^Cfc^ 

nti(2fei)! 

Cfci+fc3 + lCfc2Cfc4 

(2A;i + l)!(2A;2)!(2A;3 + 1)1(2^4)! 



(5.70) 



For small /, Wsquarei^) 1 — + ^^g°^ — • • •. The expectation value of the Wilson loop 
is a rapidly decaying function for large values of /. It would be interesting to find the 
asymptotic rate of decay. It is oscillatory but a positive function, unlike Wune- These 
variational predictions are confirmed by the numerics, in fig. 15.81 for m?' = 1, g = 1. Our 
crude mean-field variational ansatz does a very good job of estimating the Wilson loop 
averages over a range of small and moderate values of / studied, for of order unity or 
more and 5 = 1. 
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Chapter 6 

Summary 

In this thesis, we studied rather simphfied models of a non-abehan gauge theory in the 
limit of a large number of colors (N). The emphasis has been on the classical nature 
of this limit, where, unlike in the usual h ^ classical limit, where all variables stop 
fluctuating, fluctuations vanish only in gauge-invariant variables. 

Part d focused on the baryon in two-dimensional QCD, and Part on euclidean 
multi-matrix models. 

Chapter ^ contained historical motivations, and an introduction to QCD, matrix 
models, and the large- limit. We illustrated a classical limit, different from the h ^ 
limit, in the context of the large-dimension limit for rotation- invariant variables in atomic 
physics, which provides a "classical" explanation for the stability of the atom. The 1/d 
expansion is an accurate way of calculating atomic energy levels in many electron atoms, 
as shown by Herschbach and collaborators |5Uj . 

Chapter|21summarizes Rajeev's reformulation of the large- limit of two-dimensional 
QCD in terms of color-singlet quark bilinears in the null gauge: two-dimensional Hadron- 
dynamics. This is an interacting bilocal theory of mesons, since the quark density matrix 
is a projection operator, 't Hooft's linear integral equation for the meson spectrum in 
two dimensions does not account for baryons. In Hadrondynamics, free mesons are small 
fluctuations around the vacuum, and baryons arise as topological solitons on the discon- 
nected infinite grassmannian phase space with connected components labelled by baryon 
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number. 

In Chapt. El we studied the structure of baryons in two-dimensional Hadrondy- 
namics. A version of steepest descent for the curved grassmannian phase space was 
provided to determine the ground-state of the baryon. Then, the ground-state of the 
baryon was determined variationally by minimizing its energy on submanifolds of the 
phase space. A succession of finite but increasing rank submanifolds of the phase space 
was found. The reduced dynamical systems on these submanifolds were shown to corre- 
spond to interacting-parton models. We thus derived the parton picture as a variational 
approximation to the soliton description. The rank-one separable ansatz corresponds 
to a Hartree-type approximation for valence-quarks. We determined the exact two- 
dimensional baryon form factor in the large- iV and chiral limits. The rank-three ansatz 
allowed us to estimate the sea and anti-quark content of the baryon, which turned out to 
be small. To model structure functions measured in Deep Inelastic Scattering, we used 
the two-dimensional valence-quark distribution as the initial condition for perturbative 

evolution. This allowed us to model the structure function measured in neutrino- 
nucleon deep inelastic scattering. We have two free parameters, the initial momentum 
transfer Qq, and the fraction of baryon momentum carried by valence-quarks. 

Chapter 13 provided background on free algebras and probability theory for operator- 
valued random variables. The tensor algebra, its automorphisms, and derivations were 
introduced in the context of non-commutative probability theory. Voiculescu's notion of 
non-commutative entropy was defined. 

In Chapter 13 we studied euclidean large- A'^ multi-matrix models as toy models for 
gluon dynamics. We formulated them as classical mechanical systems for U{N) invariant 
variables. We found that the classical configuration space is a space of non-commutative 
probability distributions. The classical equations of motion are the factorized Schwinger- 
Dyson equations for gluon correlations. There is an anomalous term in the factorized 
Schwinger-Dyson equations arising from the change in measure of integration. We showed 
that this term is a closed but not exact one-form. This universal term is an element of 
the Lie algebra cohomology of the automorphism group of the tensor algebra. This co- 
homological obstruction prevented us from finding a classical action principle as a power 
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series in gluon correlations. We showed that the configuration space can be expressed as 
a quotient of the automorphism group of the tensor algebra by the isotropy subgroup of 
measure-preserving automorphisms. Probability distributions are parameterized by the 
change of variable that maps them to a standard reference distribution. This provides a 
classical action on the group that is invariant under the action of the isotropy subgroup. 
The action principle was interpreted as the maximization of non-commutative entropy, 
while holding fixed the correlations conjugate to the coupling tensors. The restriction 
of observables from matrix elements to U (N) invariants is the source of this entropy. 
The free energy is the negative of the maximum value of this entropy. By treating the 
change of variable as a variational quantity, we obtain variational approximations for the 
correlations and free energy of large- matrix models. A linear automorphism is the 
simplest variational ansatz, and leads to an analog of mean-field theory. This approx- 
imation method gives reasonable estimates for the observables of one and two-matrix 
models, away from divergences of the free energy (phase transitions). 
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Appendix A 

Definition of Finite Part 
Integrals 

Here we discuss the definition of the singular integrals defining the potential energy 
in momentum space. In position space integrals this singularity is manifested in the 
linearly rising \x — y\ potential. Let us consider the rank-one case for example: 

PE = l^P J [dp] J ±-;i;{p)4,*{p + r)V{T) (A.l) 

where [dp] = dp/27r. In this appendix, we desire to give a meaning to the above "finite 
part" integral. We call it a "finite part" integral since similar "part finie " integrals appear 
in the work of Hadamard The self-consistent potential V{x) satisfies Poisson's 

equation V"{x) = \ip{x)\'^ along with a pair of boundary conditions, 

|V^(y)|2Mjy and ^'(0) = - / |V(-y)|' sgn {y)dy. (A.2) 
In momentum space Poisson's equation implies that 

V{p) = TV{^ f [dq]i^ip + qW{q)) ^ J^Vi-^W^p)). (A.3) 
The two boundary conditions take the form 

- J'V J^^ \w{p)[dp] = I my)\^^-^dy (A.4) 
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/Pi 1 i-OO 

-W{p)[dp] = - / {-my)\^ + m-y)\^}dy. (A.5) 
-p p 2 Jo 

In effect, the boundary conditions are rules for integrating non-singular functions with 
respect to the singular measures 2np^,i ■ We will use these conditions to define the "finite 
part" integral in terms of Riemann integrals. 

For simplicity, let us consider the case where the wave function is real. Then 
ip{—x) = ^*{x), and the self-consistent potential is even: V{—p) = V{p). Then the 
second boundary condition (jA.SI) becomes 

rP -1 . 

J^V — W{p)[dp]=0. (A.6) 
J-p P 

Since the integrand is odd, this condition is automatically satisfied. Let us restrict 
attention to wave functions ipip) such that 

~ p", as p ^ 0, for some a > 0. (A. 7) 

From the Frobenius analysis (section we know that this is equivalent to assuming 

a non-zero current quark mass, so that superficial infrared divergences are avoided. 

Our aim is to define -J^V J^p j^W{s)[ds] so as to satisfy the first boundary condition. 
Moreover, the definition should reduce to the usual one, when this quantity is finite to 
begin with. 



Claim: If 



Then 



I p ^W{s)[ds\ := [ds] - (A. 



/P 1 roo |™| 

W{s)[ds] = - (A.9) 
-P J-oo 2 

provided VF'(O) = 0, which is ensured if ^'{p) is real and from the right at p = for 
some a > 0. 

Proof: The motivation for this definition is clear: we have essentially subtracted out 
the divergent terms and analytically continued the result that we would have got if 
W{s) vanished sufficiently fast at the origin (i.e. like s^"*"*^, e > 0) to make the integral 
converge. However, there is always the danger of mistakenly adding/subtracting some 
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finite quantity along with the divergent one. We show here that this definition actually 
satisfies the boundary conditions. 

Recall that W{s) is the Fourier transform of the charge density: 

/oo 
|V'(x)|^e-*^^dx (A.IO) 
"OO 

so that 

/oo 
(ix|V'(x)|2(e-**^-l) (A.ll) 
-oo 

Moreover, VF'(O) vanishes, since 

POO 

W'{0) = -i x\ip{x)\'^dx (A.12) 



and the integrand is odd. Therefore, W{s) — W{0) vanishes at least as fast as s^^*", e > 
as s ^ 0. E.g. for ijj{p) = Ap"e~P, W{s) = 1 - const s^"+^ + O(s^). Therefore, 

'•P W{s) - VF(0) 



p 



■[ds] (A. 13) 



exists as a Riemann integral. Since the integrand is even it suffices to consider the case 

f Ml^H = f 5^/;j*WWe--l) (A.14) 

Using the fact that only the even part of (e~*'^^' — 1) contributes to the integral over x, 
we change the order of integration, 

'^^'^-^^'\ds]=r dx\m\' r A(cos.x-l). (A.15) 



7„oo JQ 27rs2 

The inner integral can be performed in terms of the sine integral function Si. Assuming 
that '4){x) is normalized to one, we have 

/ 2 - = - / dx\ilj{x)\ v{x) (A.16) 

Where, 

v{x) = ^— (cos (Px) + PxSi{Px)) (A.17) 
2'kP 

The asymptotic expansion of Si{t) for large t is 

Si{t) ~ I + cos + 0(i)) + sint(^ + 0(i)) (A.18) 
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Since the sine integral is odd, we have 

, , Ixl 1 ^./T-, N -Pblvr ,^ Ixl Remainder(Px) ,. 
v{^) = Y + - + cos(Px)) = Y + ^^^TT^ (A.19) 

We have the desired result, except for the remainder term: 

2 — ^ = -/ hAO^ 



1 f^^ 

— |V'(2;)P Remainder(Px)(ix. (A.20) 



In the P = oo hmit, the remainder term vanishes since —1 < Remainder(t) = tSi{t) — 
™ < 1. For finite P, Remainder(t) ~ ^^f^^, \t\ ^ oo is an oscihatory function, and we 
expect the remainder term to be smah due to cancehations. In fact, it is zero. Consider 

DO 

dx\ip{x)\'^ Remainder(Px) 

oo 

= [dq] [dr]ip{q + r)ip*{q) dxe*™ Remainder (Px). (A.21) 

Jo J — Q J— oo 

Here Remainder (t) is an even function and 

/oo roo 
dxe^^^ Remainder(Px) = 2 / dxcos{rx) Remainder(Px) = 0, (A. 22) 

-oo Jo 

from the properties of the sine integral, provided |r| < P, which is the region of interest. 
Thus the "Remainder" term vanishes, and we have shown that our definition of the 
"finite part" integral satisfies the boundary conditions. This justifies our definition 

J-P j-p ttP 

when W'i^d) = 0. 
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Appendix B 



One Matrix Model 



B.l Equations of Motion and Classical Action 

The observables of the single matrix model are the U{N) invariants = of a 

single N X N hermitian random matrix A. Their expectation values are obtained by 
averaging with respect to the Boltzmann measure ^e~^^^ ^^^^ for a polynomial action 
S{A) = J2ti=i 'SnA'". The large-A^ limit is a classical limit where the fluctuations vanish 
{^n^m) = {^n){^ni) + 0{^). In this limit, the large- moments Gn are a complete 
set of observables. 

Af^oo^A^ ' N~*ooZn{S)J N 

Zn{S) = /dAe-^*'-^^^); FiS) = - lim ^Zn{S) (B.l) 

Gn parameterize the configuration space of the theory, which is the space of non- 
commutative probability distributions in a single operator- valued random variable: Vi. 
The classical equations of motion are the factorized Schwinger Dyson or loop equations 

J2lSiGk+i= GpGq:=rjk, k = 0,1,2,- ■■ (B.2) 

I p+q=k 

These are recursion relations for the higher moments Gn, n > m given the moments, 
Gi, G2, • • • Gm conjugate to the coupling constants Si, - ■ ■ Sm- The factorized loop equa- 
tions are obtained by requiring the partition function to be invariant under the infinites- 
imal non-linear change of variables : A^ A + eA*^"*"^, k = 0,1, - ■ ■ . (The case k = —1 
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also leads to a valid equation, the RHS of HB.2I) vanishes in this case, from the translation 
invariance of the measure.) The LHS comes from the change in S{A) and the RHS rjk 
is an anomaly coming from the change in the measure. The vector fields Lk act on the 
moments 

d 

LkGi = IGk+i; Lk = y^P^k+p^pr (B-3) 

p C'Gp 

and satisfy commutation relations of the Virasoro algebra [Lp, Lq] = {q — p)Lp^q 

Since {^p{A)f{A)) > for any polynomial f{A) = fnA"^, the Hankel matrix Gp^q = 
Gp^q must be non-negative. This ensures that there is a probability density on the 
real line p{x) whose moments are G„ = / p{x) dx. In fact, p{x) is the density of 
eigenvalues of the random matrix A, as discussed in ^5.31 The factorized loop equations 
turn into a linear integral equation for p{x), which we call the Mehta-Dyson equation 




(B.4) 



The factorized loop equations can be obtained by multiplying either side by z^~^^p{z) 
and integrating with respect to z. The Mehta Dyson equation follows from maximizing 
the action 

= X - / pi^)Si.V^: X-Vj log |x - yW),(y)i.^ (B.5) 

with respect to p. As explained in ^5.31 x is the entropy due to our lack of knowledge 
of matrix elements of A when we restrict measurements to the U {N) invariants of A. 
The simplest possibility is a quadratic action S{A) = ^^4^ which leads to the Wigner 
semicircular distribution p{x) = 2^-\/[4 — < 2) discussed further in Appendix 

10 The maximization of i7 involves a balance: x tends to spread the eigenvalues out 
due to the logarithmic repulsion, while — / p{x)S{x)dx is maximized if the eigenvalues 
are clustered near the minima of S{x). For polynomial actions such that S{x) ^ oo as 
|x| — > oo, the eigenvalue distribution is supported over a finite number of intervals on 
the real line, located around the minima of S{x). These intervals are the branch cuts 
of the moment generating function G{z) = ^n>oGnZ~^''~^ . p{x) is the discontinuity of 
— ^G{z) across its branch cuts. 
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Can r2(p) be expressed in terms of the moments in order to serve as a classical action 
principle for the factorized loop equations ()B.2|) ? i.e. Lk^{G) = rjk — J2i>o^'^l^k+l'^ 
Clearly, Lk(^J2i>i 'SiGi^ — J2i>i^SiGk+i- However, there is no power series in the 
moments x{G) such that Lkx{G) = rik{G) . This is because log |x — y\ cannot be 
expressed as a power series in both x and y. This is a symptom of a cohomological 
obstruction, r]{G) is not exact, but is a closed one-form valued in the space of power 
series in the moments. The components of the one-form rj are obtained by acting on the 
basis vector fields r]k = vi^k)- Now, 



dr){Lp, Lq) = Lpr]{Lq) - LqT]{Lp) - r]{[Lp, Lq]) 

= LpTJq - LgTIp -{q- p)Vp+q 



(B.6) 



and it is a straightforward calculation to check that drj = 0. 

We can get around this obstruction by expressing both the moments Gk and x in 
terms of auxiliary variables. Suppose pr is a reference probability distribution and (p 
invertible change of variables that maps pr to pc 



Pr{x) = PG{x)(t>'{x) 
Gk = J pr{y)(tJ"{y)dy 

x{G) = xi'r) +'P J dxdy Prix) priy) log 

4>{x) - (piy) 



x-y 



x-y 



:= x(r) + ^log 
If (/) is a formal power series 

oo 

Then, 



(B.7) 



n=l 



i[x + (f){x)]; 4>{x) = (i)kX^ 

k=2 



x{G) = x(r) + log 01 ^ — - — Yl 4>ki+i+h---(t)k,^+i+irXki+-kSh+-+i^ 



Gk - Yl 

p=k niH hnfe=p, ni>l 



Ti 

n=l ki+li>0 
'ni ■ ■ ■ 4'nk 
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n{G) 



O(0,r)=x(G)-^5fcGfc 



(B.9) 



k>l 



What we have done is to parameterize the configuration space Vi by automorphisms of 
the algebra of power series in one variable Gi = AutTl which transform between prob- 
ability distributions. Actually, we should restrict to the subgroup Gi of automorphisms 
that preserve the moment inequalities, i.e. the positivity of the probability distributions. 
In the case of a single generator that we are considering, the only measure-preserving 
automorphism is the identity, i.e. the isotropy subgroup SGi is trivial. So we may 
identify Vi with Gi- In other words, the moments Gk and the Taylor coefficients of the 
automorphism both serve as coordinates on the classical configuration space. The 
left action of the vector fields on is (see ^4.3(1 



h+h+-+h+i 



k + 1 



(B.IO) 



Now we can check explicitly that L^xiG) = ilk{G) 



LkX 



fc<PJn- 



d 



E[^fc'^]"7^( log 



n 

x" — y" \ 



(x) - (f)(y) 



x-y 



GpGq = rjk 

p+q=k 



- 7/" \ 

{x)-ct>{y)/r 



(B.ll) 



Thus, we have a classical action 0,{(p, T) = x{G) — J2n SnGn whose extremization leads to 
the factorized loop equations. This leads to a variational principle to solve the classical 
equations of motion: pick a reference distribution T such as the Wigner distribution (see 
AppendixEl); maximize entropy x with respect to the automorphism (j)^ while holding the 
moments G„ conjugate to the coupling constants fixed (via the Lagrange multipliers Sn)- 
The free energy F is the negative of the maximum value of entropy. As a variational 
ansatz, we may pick to be a polynomial, its Taylor coefficients are the variational 
parameters. They parameterize a finite dimensional space of probability distributions in 
the neighborhood of T. The extremization of will give the probability distribution in 
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this finite dimensional family that best approximates the exact moments of the action 
S{A). We give below a variational approximation for the quartic one-matrix model. 

B.2 Mean Field Theory for Quartic One Matrix Model 

The exact moments and free energy for the quartic one-matrix model were obtained by 
Brezin et. al. JH]- We use this as a way of calibrating our variational approach 



Z{g) = J dAe 



N tr [\A^+gA^] 



E(,) = i.(,)-f(0) = -Jim ijlogfM (B,12) 

We take the Wigner distribution with r2 = 1 as our reference (see Appendix [CI). Pick a 
linear automorphism (t){x) = (pix, which merely scales the width of the Wigner distribu- 
tion. The <j)i that maximizes il. represents the Wigner distribution that best approximates 
the quartic matrix model. 

= log 01 - ^G2((Ai) - gG^icl^i) (B.13) 
Here G2k = (l>i'^2k- Let a = i7(a) = ^ logo — f — '^go^ is bounded above only for 



g > 0. The maximum occurs at a{g) = ■ Notice that a is determined by a 

non-linear equation. The linear change of variable ansatz from a wignerian reference is 
an analogue of mean-field theory for large- matrix models. Our variational estimates 
are (C^ are Catalan numbers, see Appendix IH]) : 

= _i -i + y/ms;? 

2 ^ 16g 
= + (B.14) 

The exact results from |18| are: 

EeAg) = ^{a\g) - 1){9 -a\g))- hog {a\g)) 

GfAg) = J^^^^/ \g)[2k + 2-ka\g)]. (B.15) 

where a?{g) = 2i^[— 1 + Vl + ^^r^J. In both cases, the free energy is analytic at = 
with a square root branch point at a negative critical coupling. The mean-field critical 
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Figure B.l: Free energy E versus g. Curves top to bottom are: mean-field ansatz, 
(l){x) = (pix + (psx^ ansatz calculated to first order beyond mean-field theory, and the 
exact energy. 

coupling g^^^ = — ^ is 50% more than the exact value g^^ = ~ ^ • The variational and 
exact free energies are compared in figure iBiTl In the weak coupling region 

Eexig) =2g- 18g^ + ... E{g) = Ag - 48g^ + ... 
GlM = 1 - 85 + UV + • • • G'{g) = l-Sg + l2Sg^ + ... 

GtM = 2 - 36<7 + . . . G\g) = 2 - 32g + . . . (B.16) 

and asymptotically, as gf — > 00 

^ . X 1 , log (144) - 3 1 , 1 1 , 1 

The distribution of eigenvalues of the best wignerian approximation is given by Pg{x) = 
(pi^ PY'((f>i^x) where pr{x) = ^\/4 — x^, < 2 is the reference Wigner distribution. 
The exact distribution 

Pex{x,g) = -{^ + 4ga\g) + 2gx'')J4a^g)-x^, \x\<2aig). (B.18) 



is compared with the best wignerian approximation in figure |B21 The latter does not 
capture the bimodal property of the former. The wignerian ansatz is like mean-field 
theory, but is not restricted to small values of the coupling g. To improve on this, get 
a non-trivial estimate for the higher cumulants (connected moments) and capture the 
bimodal distribution of eigenvalues, we need to make a non-linear change of variable. 
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Eigenvalue Distribution 



0^ 




Figure B.2: Eigenvalue Distribution. Dark curve is exact, semicircle is mean-field and 
bi-modal light curve is cubic ansatz at 1-loop. 

B.3 Beyond Mean Field Theory: Non-linear Ansatz 

To go beyond mean-field theory while sticking to the standard Wigner distribution as 
the reference, we must make a cubic change of variable. = (j)ix + (j)sx^ . A quadratic 
ansatz will not increase il(</>) since S{A) is even. 0i_3 are determined by the condition 
that is maximal 



m = X-\G2-gG^ 
x[(t>] = I dxdypr{x)pr{y)\og 



4>{x) - 4>{y) 



x-y 

Ga = J dxpT[x)<P\x) = ct>\T^ + 40?(/<3r6 + 60?02pg ^ 40i</>iriO + 0|ri2 

G2 = j dxpv{x)4>\x)=4>lT2 + 2cpiCp^Ti + <t)lTQ (B.19) 

We were not able to evaluate the integral for x exactly. For multi-matrix models, the 
problem is worse, since we do not have such a closed form expression for x ^^id have to 
rely on the power series 1)5. 39(1 . However, considering the success of the linear change 
of variable, we expect the deviations of ^1^3 from their mean-field values {^/a, 0) to be 
small. We evaluate VL using the power series ()B.9|) and solve for the corrections to (/)i_3 
by linearizing around their mean-field values. This may be regarded as a kind of 1-loop 
correction beyond mean-field theory, though it is not restricted to small values of the 



coupling g. Within this approximation, we get (with a = ^^i6g^^ ~) 

^/a{-2, + 2a + {l- 'i2g)a^ + A^ga^ + 14V a^) 



a 



3 + 4a + (1 965)a2 _^ ^^g^y^z + 432520-4 
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Connected 4 point function 

J 2 3 i 5 ' 




Figure B.3: Connected 4* moment (cumulant). Dashed curve is estimate from cubic 
ansatz. Solid curve is exact. Mean field estimate was identically zero 



3 + 4a + (1 + 965)q2 + 48ga^ + 43252a4 ' ^ ' ' 

from which we calculate the variational moments and free energy. Comparing with the 
exact results of [18] . we find the following qualitative improvements over the mean-field 
ansatz. 

In addition to the mean- field branch cut from — oo to g^^^ , the vacuum energy now 
has a double pole at gf^ < gl~^°°^ = "^"^^"gg^ < g^"". We can understand this double 
pole as a sort of Fade approximation to a branch cut that runs all the way up to g^^. 
The vacuum energy variational estimate is lowered for all g. 

Figure E2] demonstrates that the cubic ansatz is able to capture the bimodal nature 
of the exact eigenvalue distribution. If ip{x) = then p{x) = pQ{ip{x))ip' (x), where 

Poix) = ^V4-2;2, |x| < 2. 

The moments G2, G4 are now within a percent of their exact values, for all g. More 
significantly, the connected 4-point function G4 = G4 — 2(^2)^ which vanished for the 
wignerian ansatz, is non-trivial, and within 10% of its exact value, across all values of g 
(figure Ell). 
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Appendix C 

Single and Multivariate Wigner 
Distribution 

The analogue of the gaussian for operator-valued random variables is the Wigner semi- 
circular distribution. Indeed, the Wigner distribution of matrices is realized when the 
matrix elements are distributed as independent gaussians. The Wigner distribution 
derives its importance from the fact that it maximizes entropy (see ^4.41 15. 3|) among all 
non-commutative probability distributions with given variance. We collect some facts 
about the Wigner distribution here. This section relies on some results on matrix models 
which can be found in Appendix El and Chapter El 

Let us first consider the case of a single hermitian N x N matrix A whose elements 
are gaussian distributed with zero mean and unit variance. The moments of A in the 
large- limit are 

r„= lim — /dAe-^t'^^^'-A"; Zn = /dAe"^*'^^^' (C.l) 
N^oo Zn J N J 

The Wigner moments satisfy the so-called factorized loop equations (AppendixEl) which 

are recursion relations 

Tn+i = ^ TpFg (C.2) 

p+l+q=n 

They may be solved in terms of Catalan numbers Cn 

(2nY 

r2n = Cn= ,; r2„+i = 0; n = 0,l,2--- (C.3) 

n!(n + 1)! 
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The first few are Fq = 1, T2 = 1, = 2, Fg = 5. The Wigner moments are re- 
ahzed by the semicircular distribution of eigenvalues = J p{x)x^dx. Maximizing 
non-commutative entropy f M.41 holding the variance fixed 

= ^(^J dxdy p{x)p{y) log \x — y\ — - J dxp{x)x'^ 
^ z = 2V [ -^dx (C.4) 

JR Z - X 

gives the Wigner semi-circular distribution p{x) = 9{\x\ < 2)-^\/4 — x"^ |18j . 

The analogue of the gaussian process and heat equation of real-valued random vari- 
ables are the semi-circular process and complex Burger equation in non-commutative 
probability theory P^. A semi-circular process starting at X is a time dependent 
operator-valued random variable Y{t) = X + \/tS where S is semicircular < S"" >= r„. 
Let G{z,t) = E 

n>o '^^l+i ^ be the moment generating function of Y{t). Then, G{z,t) 
satisfies the analogue of the heat equation, the complex Burger equation: 



The standard multivariate Wigner distribution corresponds to a gaussian M matrix 

2 



model with action Sq{A) = ^ J2i=i the large- limit 



Ti = hm /dAe-i^^'^f-^-^.^ 

Zn = IdAe-^^^"*--^'^^ (C.6) 
The odd moments vanish as before and the second moment is 

Tij = Sij (C.7) 
The factorized loop equations H5.1U() are again recursion relations 

r,, = rjid'/''^ri,Tj, (c.s) 

involving the order-preserving partition of the original string I into sub-strings. By 
iterating, we may reduce the evaluation of an arbitrary moment to a non-crossing sum 
over products of the second moment 

7reNCP2(n-j„) {ja,*(,}e7r 
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This is Wick's theorem for Feynman diagrams of planar topology. NCP2(«i, • • • ,in) is 
the set of non-crossing partitions of the cychcally symmetric set of indices (ii • • • in) into 
pairs. For example 

Tij = 5ij 

^ijkl — ^ij^kl ~l~ ^il^jk 
^ijklmn — ^ij^kl^mn ~l~ ^ni^jk^lm ~l~ ^il^jk^mn ~l~ ^ij^kn^lm ~l~ ^ jm^ni^ kl{C -^0) 

The Wigner distribution serves as a candidate for a reference distribution from which 
other distributions may be obtained by a change of variable. For example, a Wigner 
distribution with positive covariance Gij = [K^^]ij which arises from the action S{A) = 
^K^^AiAj can be obtained from the standard one. Use the invertible linear transforma- 
tion Ai I— > (1)1 Aj where Gij = Tkicjj'lcf)^ By the polar decomposition, the space of Wigner 
distributions is the coset space GL{M)/0{M). 

We can use a similar change of variable along with our variational principle to find the 
best wignerian variational approximation to a matrix model. It is a kind of mean-field 
theory, as discussed in ^5.41 
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Appendix D 



Anomaly is a Closed 1-form 



The conditions for 7/ to be a closed 1-form are ()5.16j) 



(D.l) 



where 77/ = 6j G^^G^^ . Let us check that these are satisfied. The RHS of H5.16() is: 



RHS 



G^^ G^^ 



"JiiJ2y"KijP'^K2 +"PjQ"Ki'^K2 +"PjK2"Ki 



{i ^ j,I ^ J) 



"KijJiiJ2<^K2 + °JliJ2jK2°K^ 



(i ^ j,I ^ J) 



We cancelled the middle terms using acyclicity of G^ . On the other hand 



(D.2) 



^J\jJ2\.^KiiK2 



qK\IK2qJ2 _j_ ^^^2 f-iJl qKiIK21^ 



"JliJ2jJ3 



/^JiIJ2rfJz _i_ xJ r-<Jir-'J2lJ3 



^ ^ ["JliJ2ii^2"Xi + "KijJii.J2'^K2 



■hiJ2-\ 



(D.3) 



Thus the LHS of (06j) is: 



Comparing ()D.4|) and (|D.2|1 . we see that the integrability condition (|5.16|) is satisfied. 
Thus, rjl is a closed one-form. 
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Appendix E 

Group Cohomology 

It is natural in mathematics to study the cohomology of a group twisted by a represen- 
tation [7^1 ISS]. Given a group G and a representation of G on a vector space V {V is 
called a G module), we can define a cohomology theory. We will sometimes call this the 
cohomology of G valued in V. The r-cochains are functions 

c:G''^V. (E.l) 

The coboundary d is defined on r-cochains as 

dc{gi,g2,--- ,gr+i) = gic{g2,--- ,gr+i) 

r 

+ '^{-'^Yc{gi,g2,- ■ ■ ,gs-i,gsgs+i,gs+2,- ■ ■,gr+i) 

+(-ir+ic(<7i,---,9r). (E.2) 

and satisfies = 0. A closed cochain {dc = 0) is called a cocycle. An exact cochain 
(6 = dc for some c) is called a coboundary. The r^^ cohomology of G twisted by the 
representation of G on V, H'^{G,V) is the space of closed r-cochains modulo exact 
r-cochains. 

For example, H^{G,V) is the space of all G-invariant elements of V, i.e. the space 
of V £ V satisfying gv — v = foi all g G G. A 1-cocycle is a function c : G ^ V that is 
closed, 

dc{gi,g2) = gic{g2) - cigm) + c{gi) = 
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I.e. 



cigm) 



gic{g2) + c{gi). 



(E.3) 



The space of solutions to this equation modulo 1-coboundaries (which are of the form 
b{g) = gv — V for some v V) is the first cohomology H^{G, V). If G acts trivially on 
V, a cocycle is just a homomorphism from G to the additive group of V: 0(51^2) = 
c(52) + c{gi). 

As an example consider the loop group G = S^G' = {g : ^ G'} of a Lie group 
G' . And let V = S^G[ be the loop of the Lie algebra G. Then there is the adjoint 
representation of G on V: adgV = gvg~^. A non-trivial 1-cocycle is c : G ^ V, c{g) = 
gdg~^ . It satisfies the co-cycle condition (|E.3|) 



c{gh) = ghd{h^^g^^) = g[hdh ^]g ^ + ghh ^dg 



1 



a.dgc{h) + c{g). 



(E.4) 
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Appendix F 



Formula for Entropy 



Here we show that (^{(f), A) = ^ log det J{4>, A) can be expressed in terms of the loop 
variables and the coefficients of the automorphism (j)^. The jacobian matrix of the 
change of variable 

cP{A), = [cPi]i4>{A)y, 4>{A), = A,+ Y.4>iAi 

l^l>2 



m+ra>l 

= [^i]'[slScSU+Kj;j^{A)^ (F.l) 
Or suppressing color indices, 

j/(<^,A) = [<Ai]f{<5il®l+ E <^i'''^/®^j}^[0i]f{<^il^l + i^^(A)} (F.2) 



Then 



|/|+|J|>1 



a{(P,A) = l^logdet[M^ + K{A))] 

(_-i\n+l 1 

= logdet</.i + E^^ ^t^^" 

n>l 



So we calculate 



Appendix rn Formula for Entropy 



137 



Thus 

a(<^, A) = log det <Ai + ^ ^^^i^i»2Li^x,.,L, . . . ^^^^^ (^.5) 

n>l " 
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